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(57) ABSTRACT 

A switching device for forwarding network traffic to a 
desired destination on a network, such as a telephone or 
computer network. The switching device includes multiple 
ports and uses a lookup table to determine which port to 
forward network traffic over. The lookup table includes 
network addresses that are maintained in ascending or 
descending order. The switching device includes multiple 
binary search engines coupled in series including one or 
more precursor binary search engines and a final stage 
binary search engine. Together, the binary search engines 
perform an N iteration binary search. Additionally, a single 
search engine can perform multiple concurrent searches so 
that source and destination addresses can be obtained simul- 
taneously and without wasted memory cycles. 
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NETWORK SWITCHING DEVICE WITH 
CONCURRENT KEY LOOKUPS 

CROSS-REFERENCE TO RELATED 

APPLICAnON 5 

This application is a continuation of prior application Sen 
No, 09/166,707, filed Oct. 5, 1998, now U.S. Pat. No. 
6,161,144. 

This application is based on provisional U.S. patent iq 
application Serial No. 60/072^0 filed Jan. 23, 1998, and 
entitled "Forwarding Database Lookup Technique." 

HELD OF THE INVENTION 

This invention relates generally to networks, such as 
telephone and computer networks, and, more particularly, 
relates to routing iiiformation through such networks. 

BACKGROUND OF THE INVENTION 

20 

A network allows two or more parties to communicate 
with each other. In their simplest form, networks generally 
include transmission lines and switching devices (e.g., 
routers, switches, switching routers, etc.). The transmission 
lines carry signals (e.g., electrical, optical, etc.), while the 25 
switching devices are intermediate stations that establish 
temporary connections between trananission lines. In tele- 
phone networks, for example, a caller's line goes to a 
switching device where the actual connection is made to the 
called party. In computer networks, devices such as routers 30 
receive messages on the network and forward the messages 
to their correct destinations. Computer networks can be as 
small as a local area network (LAN) consisting of a few 
computers, printers, and other devices, or it can consist of 
many computers distributed over a vast geographical area 35 
(e.g., the Internet). 

An example computer network 10 is shown in FIG. lA 
The network includes two local segments 12 and 14, and 
connection to a remote network 16. Nodes, labeled as A-J, 
represent computers connected to the local segments. A 40 
switching device 20 includes three ports 22-24 and switches 
network trafiSc between segments 12, 14, and the remote 
network 16. Network 16 may also include switching 
devices, such as switching device 21, which then coimects 
other segments (not shown) to the network. Switching 45 
device 20 allows the nodes on one segment to communicate 
with nodes on other segments and to other switching 
devices. The nodes communicate with each other through a 
protocol (e.g., HTTP, TCP/IP. SMB, etc.) which allows the 
nodes to transmit and receive network frames (a network 50 
frame includes a destination address, a source address, and 
a data field). When switching device 20 receives a frame 
from a node, it analyzes the destination address by searching 
a lookup table 26, shown in FIG. IB. Lookup table 26 
includes table entries having a network address field and a 55 
port field. When the destination address is matched to a 
network address in the lookup table, switching device 20 
determines which port to forward the frame to by obtaining 
the port number corresponding to the matched network 
address. For example, if node A on segment 12 sends a 60 
message to node H on segment 14, switching device 20 
receives the message from node A and in response searches 
the entries in the network address field of lookup table 26. 
Table entry 28 contains the network address for H. A 
corresponddng port field 30 for network address H indicates 65 
that the frame should be forwarded over port 2. Additional 
background information on switches can be found in a 
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number of references, such as Fast Ethernet (1997) by L. 
Quinn et al.. Computer Networks (3^^^ Ed. 1996 by A 
Tannenbaum, and High-Speed Networking with LAN 
Switches (1997) by G. Held, all of which are incorporated 
herein by reference. 

The switching device can obtain the network addresses 
for the lookup table in different ways, depending on the 
particular implementation of the switching device. For 
example, the switching device may snoop network traflSc so 
that when a frame is received on a port, the switching device 
determines if the frame's source address is in the table and, 
if it is not, adds an entry containing the source address and 
the inbound port to the table. Thus, the switching device is 
said to "learn" addresses and port numbers from any frame 
that is transmitted by a node. Another technique some 
switching devices (e.g., routers) use to obtain the lookup 
table is from other switching devices through a special 
protocol. TTaus, routers supply network addresses to each 
other to supplement their lookup tables. 

Consequently, when a network frame is received in a 
switch, both the source and destination addresses must be 
searched in the lookup table — the source address for "learn- 
ing" and the destination address for forwarding. To search 
the lookup table, a single search engine (not shown) within 
the switch 20 sequentially accesses lookup table entries and 
compares the entries to the destination address in the net- 
work frame. After the search for the destination address is 
completed, a second independent search is performed for the 
source address. An example timing diagram 40 for the 
search engine is shown in FIG. 2. During a first clock cycle 
42, the search engine accesses the lookup table and obtains 
a network address. E>uring a second clock cycle 44, the 
search engine compares a network address obtained from the 
lookup table to the destination address. The first and second 
clock cycles together form a single iteration of the search. If 
there is no match, the search engine loads an address for the 
next lookup table entry to analyze. The process is repeated 
during clodc cycles 46 and 48 (a second iteration), and so on 
until a match is found or the search fails. After the search for 
the destination address is completed, a second search is 
performed for the source address. Unfortunately, given 
current memory and ASIC speed limitations, the above- 
described technique for searching the lookup table is too 
slow to meet the requirements of recently-developed gigabit 
switches. Additionally, with sequential searching, the lookup 
table is only accessed every other clock cycle, wasting 
valuable time. 

An objective of the present invention, therefore, is to 
provide a high-speed network switching device that can 
quickly and efficiently search through address lookup tables 
and that overcomes the limitations of the prior art. 

SUMMARY OF INVENTION 

The present invention provides a switching device (e.g., 
router, switch, switching router, telephone switch, etc.) that 
forwards network traffic to a desired destination on a 
network, such as a telephone or computer network. The 
switching device includes multiple ports and uses a lookup 
table to determine which port to forward network traffic 
over. The network traffic is typically in the form of network 
frames that include source and destination addresses. 

In one aspect of the invention, the lookup table includes 
network addresses that are maintained in sorted order (e.g., 
ascending or descending order) to facilitate binary searches. 
Additionally, multiple binary search engines are connected 
in series and perform multiple binary searches simuJta- 
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neously. Rather than having each bioary search engine 
perform an independent search, the binary search engines 
each perform a predetermined number of iterations of an N 
iteration search. Additionally, each search engine has a 
separate memory, which stores only data (nodes from the 5 
lookup table) necessary for its respective iterations of the 
search. When a search engine completes its iterations of the 
search, the results are passed to the next search engine in the 
series. The next search engine uses the results from the 
previous search engine as a starting point to performing its 10 
respective iterations of the binary search. The number of 
search engines connected in series (also called pipelining) 
can vary between two and N, where N is the number of 
iterations needed for a binary search to complete. If N search 
engines are used, each search engine performs only one 15 
iteration. 

By pipeUning search engines, increased throughput is 
achieved. Additionally, by storing only the data necessary 
for a predetermined nimiber of iterations, very little memory 
space is needed. For example, a lookup table with 64K 20 
entries requires 16 iterations (2^=64IC) to search the entire 
table using a binary search. If two search engines are used, 
each search engine performs 8 iterations each. The first 
search engine's memory only stores 256 entries (2^, while 
the second search engine stores either all 64K entries or the 25 
remainder of the entries after extraction of the 256 entries 
from the lookup table. 

Id another aspect of the invention, each search engine may 
perform concurrent source and destination searches of the 
lookup table. That is, dtiring one clock cycle, the search 30 
engine performs part of a search for the destination address 
of the network frame and duiring the next clock cycle the 
search engine performs part of a search for a source address. 
Thus, one search engine performs two independent searches 
concurrently. Moreover, while the lookup table is being 35 
accessed for the destination address, the source address from 
the previous look up is being analyzed so that a next source 
address may be calculated. Likewise, when the lookup table 
is being accessed for the source address, the previotisly 
obtained destination address is analyzed. The concurrent 40 
source and destination address lookups allow the search 
engines to search at twice the speed of search engines 
performing linear searches. Additionally, the lookup table is 
accessed during every clock cycle rather than sitting idle 
during some clock cycles. Perforniing concurrent searches is 45 
independent of the algorithm used. Thus, binary, linear, 
CAM, binary radix trees, hashing and other types of 
searches can be used. 

It will be recognized by those skilled in the art that the two 
described aspects of the invention can be performed inde- 50 
pendent of one another. That is, a switching device may have 
just one binary search engine (as opposed to multiple search 
engines in series) that performs concurrent source and 
destination address searches. Additionally, a switching 
device may have multiple search engines connected in series 5S 
wherein die search engines do not perform concurrent 
searches. Alternatively, a switching device may include both 
aspects with search engines coupled in series and with the 
search engines performing concurrent searches. 

These advantages and other advantages and features of 
the inventions will become apparent from the following 
detailed description, which proceeds with reference to the 
following drawings. 

BRIEF DESCRIPTION OF DRAWINGS ^5 
FIG. lA shows a known network containing multiple 
segments connected through a switching device. 
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FIG. IB shows a known lookup table stored in the 
switching device for identifying nodes connected to the 
segments. 

FIG. 2 shows a timing diagram of a search engine within 
the switching device of FIG. lA. 

FIG. 3 shows a switching device according to one aspect 
of the invention with two binary search engines coupled in 
series. 

FIG. 4A shows a lookup table in sorted order according to 
another aspect of the invention. 

FIG. 4B shows the lookup table of FIG. 4A in binary tree 
format and showing four iterations of a binary search. 

FIG, 4C shows a memory configuration for a switching 
device having four search engines with each search engine 
performing one of the iterations of FIG. 4B. 

FIG. 4D shows search results that are passed between 
search engines from binary searches performed on the 
lookup table of FIG. 4A. 

FIG. 5 shows a flowchart of a method for forwarding 
network frames in the switching device of FIG. 3. 

FIG, 6 shows a flowchart of a method fiar performing 
different iterations of a binary search in separate binary 
search engines. 

FIG. 7 shows a detailed circuit diagram of a binary search 
engine according to another aspect of the invention where 
source and destination searches are performed concurrently. 

FIG, 8A shows a flowchart of a method for concurrently 
performing source and destination searches using the binary 
search engine of FIG. 7. 

FIG. 8B is a timing diagram for the binary search engine 
of HG. 7. 

FIG. 9 shows another example of a lookup table using 
pointers to access the network addresses. 

DETAILED DESCRIPTION OF AN 
ILLUSTRATIVE EMBODIMENT 

FIG. 3 shows a switching device 50 having a port 52 and 
port intercommunication logic 54, Port 52 includes a media 
interface 56, a primary memory 58, and a search engine 60. 
The search engine 60 includes temporary packet storage 62, 
packet analysis and key extraction logic 64, two internal 
binary search engines 66, 68, a first stage memory 70, and 
forwarding decision logic 72. There are multiple ports (not 
shown) in switching device 50. One or more ports are 
located on channel cards (not shown) mounted in a chassis. 
The number of ports and how the ports and port intercom- 
munication logic 54 are mounted within a chassis are based 
on the particular application and are not important to the 
invention. 

Media interface 56 cotmects switching device 50 to a 
network (not shown) through a network cable 74. The 
network cable can take a variety of forms (e.g., fiber optic, 
twisted-pair, coaxial, etc.) depending on the type of network. 
A variety of network standards and protocols may be used, 
such as TCP/IP, IPX/SPX, FDDI, ATM, ETHERNET, 
GIGABIT ETHERNET, FAST ETHERNET, Token Ring, 
SONET, 100-base TX, etc. Other network protocols, 
standards, and network cables now existing or later devel- 
oped may be used with the invention, since these particular 
aspects are not important to the invention. Media interface 
56 is a communication link between search engine 60 and 
the network. Thus, media interface 56 aUows search engine 
60 to send network frames in any desired format and media 
interface 56 reformats the frames for the particular network. 
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Similarly, media interface 56 receives network frames from 
the network and formats the frames so they may be read by 
search engine 60. The media interface iised is based on the 
particular application and is not important to understanding 
the invention. 

Temporary packet storage 62 within search engine 60 
holds the network frame temporarily while other compo- 
nents within the search engine determine where to forward 
the network frame to, as is further described below. In some 
circumstances, a decision may be made to not forward the 
packet at all. In such cases, the network frame is never 
passed to the port intercommunication logic. Depending on 
the size of the network frame, the temporary padcet storage 
62 may hold several network frames, or, alternatively, only 
a portion of a network frame as it is forwarded to another 
port. 

Packet analysis and key extraction logic 64 extracts the 
source and destination addresses from the network frame 
and forwards the addresses to the first binary search engine 
66. The packet analysis and key extraction logic 64 may also 
pass additional information to the binary search engine 66, 
such as virtual LAN information that comes with the net- 
work frame or is derived based on the type of network frame. 
This additional information is appended to the source and 
destination addresses to form a key. One skilled in the art 
will recognize that the key may include any desired 
information, depending on the particular application. For 
example, the key may contain only the destination address, 
or only the source and destination addresses, or the source 
and destination addresses and additional information. For 
simplicity, the key is described generically below as being 
the source and destination addresses. 

Binary search engines 66 and 68 are coupled to respective 
memories 58 and 70. The memories together store a lookup 
table that the binary search engines use for analyzing 
network frames received from media interface 56. A portion 
of an example lookup table is shown in FIG. 4 and is further 
described below. A management processor (not shown) is 
coupled to search engines 66, 68 and maintains the lookup 
table in memories 58, 70. Specifically, the management 
processor may direct the search engines to delete old table 
entries, insert new table entries and generally maintain the 
table in sorted order so that the search engine 60 performs 
searches efficiently and correctly. 

As is described in detail below, binary search engines 66, 
68 perform a binary search on the lookup table. A binary 
search is a technique for searching a sorted table where 
successive midpoints of the table are determined and com- 
pared against a search term. Thus, to start the search the 
midpoint of the table is compared against the search term. 
By using the midpoint, the table is effectively divided into 
two parts and a determination is made as to which of the two 
parts the search term must reside in. Only the part that has 
the search term is analyzed and the other part is ignored. The 
midpoint of the pertinent part is then determined to divide 
the table into two more parts, and so on until the key is found 
or the search fails. Binary searches are understood in the art 
and are described in a text entitled Design and Analysis of 
Algorithms by Jeffrey D. Smith, 1989. Binary searches 
require a predetermined number of "iterations** to complete. 
The number of iterations depends on the size of the lookup 
table and follows the formula log2N=number of iterations, 
where N is the number of entries in the table. Thus, a 64K 
table requires 16 iterations to complete a binary search, 
while a 256 entry table requires 8 iterations to complete. 

One aspect of the invention is that the binary search 
engines divide the binary search of the lookup table by each 
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performing some of the iterations of the overall search. For 
example, if the lookup table has 64K entries, the binary 
search engine 66 performs the .first eight iterations of the 
search and binary search engine 68 performs the last eight 

5 iterations. Additionally, the first stage memory 70 does not 
contain the entire bokup table. Instead, it only contains 256 
entries needed for the first eight iterations of the search. The 
256 entries are determined and copied from the lookup table 
and include the successive midpoint combinations from the 

JO lookup table needed for the fiiit eight iterations. Once the 
binary search engine 66 completes its eight iterations, the 
results are passed to binary search engine 68. Binary search 
engine 68 then uses the results as a starting point for its eight 
iterations of the binary search. A substantial amount of 

J 5 memory is saved by only using successive midpoint com- 
binations in the memory 70. Additionally, although two 
search engines are shown, any desired nimiber of search 
engines may be used. For example, for a 16 iteration search, 
16 binary search engines can be used with each search 

2Q engine performing one iteration. Alternatively, 4 binary 
search engines may be used with each binary search engine 
performing 4 iterations. Still further, the partitioning of 
iterations across search engines need not be equal. Also, the 
lookup table can be any desired length. Consequently, the 

25 binary search engines can perform any number of iterations 
depending on the particular application. 

After binary search engine 68 completes the binary 
search, the results are passed to forwarding decision logic 
72, which examines the results and applies a predetermined 

30 set of rules to determine whether the network frame shoiild 
be forwarded and ^ich port or ports it should be forwarded 
to. Forwarding decision logic may also examine the level of 
priority of the network frame. Higher priority frames are 
typically forwarded by the switching device 50 before lower 

35 priority frames. 

When the forwarding decision logic 72 determines that a 
frame is to be forwarded to other ports in switching device 
50, it passes the network frame to the port intercommuni- 
cation logic 54. Port interconomunication logic 54 includes 

40 a switch fabric 76 and a switch fabric control 78. Switch 
fabric 76 can take a variety of forms. For example, the 
switch fabric can be a cross-bar switch, which is commonly 
used in telecommunications switching. The cross-bar switch 
creates a path between a receiving port and a transmitting 

45 port so that the networic frame may be passed therebetween. 
A wide variety of cross-bar switches may be used, such as 
cut-through switches, interim cut-throtigh switches, and 
store -and-forward switches. Other types of switch fabrics 
may also be used. For example, switch fabric 76 may also be 

50 a central memory using a bus arbitration device and a central 
bMS. Using a shared-memory bus architecmre, all ports 
access a central located memory pool The ports can access 
the central memory through a common bus wdien an arbi- 
tration device grants access. Another possible switch fabric 

55 that can be iised is a parallel access shared-memory archi- 
tecture. In a parallel access shared memory, all ports share 
a central memory location. However, a bus arbitration 
scheme is not used. Instead, every port has a dedicated path 
into and out of the central memory fabric. Therefore, all 

60 ports can simultaneously access the centralized memory 
pool at any time. A wide variety of existing switch fabrics or 
later developed switch fabrics may also be used. The par- 
ticular switch fabric and switch fabric control is not of 
importance to the invention. 

65 Switch fabric control 78 controls network frames as they 
are passed through the switch fabric. In the case where the 
switch fabric is a cross-bar switch, the switch fabric control 



06/25/2003, EAST Version: 1.03.0002 



us 6,453,358 Bl 

7 8 

is typically called a scheduler. The scheduler establishes a of data entries. The first portion 106 can similarly be divided 

connection within the cross-bar switch so that a search into two portions 110, 112 with a midpoint at address OlOOB. 

engine on one port can directly pass a network frame to a The second portion 108 also can be divided into two portions 

search engine on another port. In the case where the switch 114, 116 with a midpoint at address llOOB. By determining 

fabric is a memory, the switch fabric control teUs a receive- 5 successive midpoints, a binary search tree can be generated, 

side search engine where to store the frame in memory. After nc. 4B shows a binary search tree 118 for the lookup 

the frame is stored in memory, the switch fabric control table 100 of FIG. 4A. The search tree shows four iterations, 

signals a transmitting-side port that the network frame is which are the maximum nimiber of iterations required for 

ready to be transmitted and provides the address of the any search on a 16 entry table (i.e., 2^^16). The tree is 

memory location where the frame in located. The switch 10 formed by determining the successive midpoints, called 

fabric control may also provide priority information to the nodes, in the lookup table as described in relation to FIG. 

transmitting-side port. 4A. To perform a binary search, the search term is compared 

FIG. 5 shows a flow chart of a method used by the against the network address located at the midpoint address 
switching device 50 for forwarding network frames. In step lOOOB of the lookup table (i.e., 26). If the search term is less 
80, the media interface 56 receives a network frame from the 15 than 26, it is compared to 14. Alternatively, if it is greater 
network and passes the frame to search engine 60. The than 26 it is compared to 38, and so on, until a match is 
network frame contains a destination address that indicates found or the search fails. Each comparison that is performed 
the ultimate destination for the network frame. In step 82, is another iteration of the binary search, 
search engine 60 searches memories 58, 70 to determine FIG. 4C shows that the different iterations of the binary 
whether the destination address from the network frame is 20 search may be divided amongst 4 diflferent binary search 
located within the lookup table. If a network address engines connected in series. A memory 120 for a first search 
matches the destination address, a port that the frame should engine only requires one data entry (one node) £ar the frrst 
be forward to is also obtained from the lookup table. iteration of the binary search, which represents the midpoint 
Assuming that the destination address was properly found in of the lookup table. A memory 122 for a second search 
the lookup table and the appropriate port was determined, 25 engine requires two data entries for the second iteration, 
the search engine passes the search results to switch fabric which represent the midpoints of portions 106 and 108 of the 
control 78 (step 84). In step 86, search engine 60 transfers lookup table 100 (See FIG. 4A). A memory 124 for a third 
the network frame through switch fabric 76. If the switch search engine contains additional successive midpoints of 
fabric is a cross-bar switch, the switch fabric control 78 the lookup table needed for the third iteration. And a 
estabhshes the connection in the switch fabric and commu- 30 memory 126 for a fourth search engine contains the remain- 
nicates to search engine 60 when to send the network frame. ing data entries of the lookup table. The fourth memory 126 
If the switch fabric is a central memory with or without bus may also contain the entire lookup table 100. The memories 
arbitration, the switch fabric control tells search engine 60 120, 122, and 124 (called the precursor memories) may 
where in the switch fabric to store the network frame. Search include only the network address field of the lookup table, 
engine 60 then stores the network frame at the indicated 35 without any payload fields. Alternatively, the precursor 
location. The switch fabric control also informs the other memories 120, 122, and 124 may include the payload 
ports of where the network frame is stored so that they may information so that the lookup table is effectively distributed 
properly access it within the switch fabric. Regardless of the amongst many memories, even though it is a unitary data- 
technique used with the switch fabric, the network frame is base. In any event, the precursor memories contain succes- 
obtained from the switch fabric and transmitted on one or 40 sive midpoint possibilities (nodes) for the different iterations 
more ports (step 88). of a binary search and the nodes are stored adjacent to one 

FIG. 4A shows an example lookup table 100 having 16 another. The precursor memories could contain additional 

data entries, OOOOB through 1111 B. Each table entry table entries other than just the successive midpoint 

includes two or more fields including, an address field 102, possibilities, if desired. 

illustrated as a network address, and a forwarding informa- 45 FIG. 4D shows an example of data passed between the 

tion field 104, illustrated as a port number. Although a four search engines of FIG. 4C in searching for the network 

network address field is shown, other addresses used in address 17 (the key) located at address 0101 in the lookup 

switching devices may be used. The forwarding information table (see FIG. 4A). As comparisons are made during 

field also may contain information other than the port iterations of the binary search, the final address of 0101 is 

number. For example, the forwarding information field may 50 generated. That is, the first search engine generates the most 

analyze information related to the protocol format (e.g., significant bit, the last search engine generates the least 

MAC) that the address is in to determine which port to significant bit, and the intermediate two bits are generated by 

forward the network frame to. The network address field 102 the intermediate search engines. When a comparison is 

contains network addresses in sorted order (e.g., ascending made, if the key is less than the network address obtained 

or descending order). The port ntunber field 104 contains the 55 from the lookup table, a zero is passed to the next search 

port associated with the network address field. One or more engine. Alternatively, if the key is greater than the network 

of these data fields may be deleted, or alternatively, addi- address, a one is passed to the next seardi engine. Thus, with 

tional fields may be used. Additional fields may, for the key being 17, the first search engine compares 26 (the 

example, show a priority status associated with the network first iteration) and 17. Since 17 is less than 26, a 0 is passed 

address. The network address field 102 identifies a destina- 60 to the next search engine in series as indicated at 128. The 

tion to transmit the network frame to, such as a computer on 0 represents the most significant bit of the final address 

a network. The network address field is used as a key for where the network address 17 resides in the lookup table, 

accessing the other fields (often called a payload) in a table The next search engine then compares 17 against 14, since 

entry. Address OOOOB is empty so that the lookup table 38 is no longer a possibility. Since 17 is greater than 14, the 

contains an odd number of entries. The midpoint of the table 65 next bit in the address is a 1. Consequently, a 1 is appended 

is at address lOOOB and the lookup table can be equally to the most significant bit and a 01 is passed to the next 

divided into two portions 106 and 108 with equal numbers search engine as indicated at 130. The next search engine 
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then compares 17 and 20. Since 17 is less than 20, a 0 is nodes (i.e., successive midpoints) of the lookup table in the 

appended to the previous passed address resulting in a 010 case of a binary search. In the case, of a linear search, the 

being passed to search engine 4, as indicated at 132. The last next address is simply the previous address plus one. Thus, 

search engine must then compare 17 against 17 to obtain an the logic circuits 172, 174 perform different functions based 

address of 0101, which is the appropriate address in the 5 on the application. The outputs of logic circuits 172, 174 are 

lookup table (See FIG. 4A at 134). coupled to respective SA and DA address registers 176, 178. 

FIG. 6 shows a flowchart of a method for performing The outputs of the address registers 176, 178 are coupled to 

binary searches as described in FIGS. 4A-4D. In step 136, multiplexer 180 and are also fed back into next address 

muhiplc search engines are coupled in series. WtuaUy any calculation logic 172, 174, respectively. The multiplexer 180 

number of search engines may be used. For example, FIG. 10 ^ ^o^P^^ ^o a clock Une 182 that switches a multiplexer 

3 shows two search engines, while FIG. 4C describes using output so that the contents of the SAaddress register 176 and 

four search engines. TTie invention can easily be extended to DA address register 178 are passed to memory 162 m an 

eight, sixteen, or any other number of search engines. Each alteniatmg fashion. The contente of the SA aud DA address 

selrch engine that performs iterations of the binary search '^^f'!^ ^^^^ 17«/ontam an address of a lookup table entry 

. y . ^^o\ ' £ and when applied to memory 162 causes the memory to 

has Its own separate memoir (step 138). THe memories for ^5 output data ^Le., a network address) on a data output 184. 

the search engmes store nodes (the network address field Qock 182 is also coupled to both data registers 168, 170 and 

with or without the payload mformation) needed for that synchronized so that data is latched into source data 

search engine to perform its respective iterations of the register 168 when the multiplexer 180 passes the contents of 

binary search (step 140). The nodes are the successive samx. address register 176. Similarly, data register 170 

midpoint possibilities of the lookup table in binary tree 20 latches the data output 184 of memory when the address 

format. In step 142, a first search engine in the scries stored in DA address register 178 passes through the mul- 

performs a predetermined number of iterations of the binary tiplexer 180, Search termination logic 185 determines when 

search. The results of the iterations are then passed to the a search is complete by analyzing outputs of the address 

next search engine in the series (step 144). In step 146, a registers 176, 178 and the comparators 164, 166. An output 

check is performed to see whether the results are being 25 of the search termination logic 185 can be passed to the 

passed to the last search engine in the series. If not, then forwarding decision logic 72 (see FIG. 2) or to another 

steps 142 and 144 are repeated. If yes, then the final search engine connected in series to signal when a search is 

iterations of the binary search are performed and the results complete and to pass the results of the search, 

are passed to the forwarding decision logic (step 148). FIG. 8B shows a timing diagram for the circuit of FIG. 7, 

Although a wide variety of binary search engines can be 30 A clock signal 186 is at twice the frequency of the clock on 

used in the method and apparatus described in FIGS. 1-6, a clock line 182. Three cycles 188, 190, and 192 of clock 

particular search engine is shown in relation to HGS. 7, 8A, signal 186 are shown. During a first clock cycle 188, the SA 

and 8B. This search engine can also be used independently address register 176 (FIG. 7) is already loaded with a lookup 

of the method and apparatus of FIGS. 1-6, In particular, the table address and the multiplexer 180 passes that address to 

search engine can be used with searching techniques other 35 the memory 162. Consequently, memory 162 outputs data (a 

than binary searches including linear searches, CAM, binary network address) on data output 184 corresponding to the 

radix trees, hashing, etc. address in the SA address register. At the end of cycle 188, 

Prior search engines perform a search for the source the source data register 168 is loaded with the data from the 

address from the network frame and, only when the source memory 162. Also during cycle 188, logic circuit 174 is 

address search is completed, perform a search for the 40 calculating the next lookup table address to analyze for the 

destination address (See FIG. 2). The present invention, by destination address search and the results arc stored in DA 

contrast, performs both source and destination address address register 178. Clock cycle 188 represents the start of 

searches concurrently. FIG. 7 shows a search engine 160 a source address search iteration. 

coupled to a memory 162 containing a lookup table. The Ehiring clock cycle 190, comparator 164 compares the SA 

search engine 160 includes a source address comparator 164 45 search key and the SA data register 168. The next address 

and a destination address comparator 166. The source calculation logic 172 determines the next lookup table 

address comparator 164 has one input coupled to a source address to analyze for the search, and the results are loaded 

address (SA) search key. This SA search key is the source into the SA address register 176. Also during clock cycle 

address obtained from the network frame and passed to the 190, the output of the DA address register is passed through 

binary search engine 160 by the packet analysis and key 50 mult^ilexer 180 and the corresponding data from memory 

extraction logic 64 (see FIG. 3). A second input to com- 162 is latched into DA data register 170. Clock 190 repre- 

parator 164 is coupled to a source address data register 168 sents the start of a destination address search iteration. At the 

that contains a network address most recently obtained from end of clock cycle 190, the source address search iteration 

memory 162 during a previous iteration of the source is completed. Thus, clock cycles 188, 190 represent a 

address search. The destination address comparator 166 also 5S complete source address search iteration in a search requir- 

has two inputs with one coupled to a destination address ing multiple iterations. 

(DA) search key obtained from the network frame and the During clock cycle 192, another source address search 

other input coupled to a destination address data register 170 iteration is started and the destination address search itera- 

containing a network address obtained from the memory 162 tion is completed. The comparator 166 compares the DA 

during a previous iteration of the destination address search. 60 search key and the output of the DA data register 170. The 

The output of the comparators 164, 166 are coupled to next address calculation logic 174 calculates the next lookup 

respective next address calculation logic circuits 172, 174. table address to examine and loads the calculated address 

The next address calculation logic circuits 172, 174 use the into the DA address register. Thus, during the three clock 

lookup table address from the previous search iteration and cycles 188, 190, and 192 both a source and destination 

theresultsof the comparators 164, 166 to determine the next 6S address search iterations are performed concurrently, 

lookup table address to analyze. The logic circuits 172, 174 Additionally, the source and destination address searches are 

make such a determination of the next address based on the independent of each other. 
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The hardware of FIG. 7 can be modified while maintaia- 
ing the same timing diagram of FIG. 8B. For example, the 
multiplexer 180 can be removed and the address registers 
176, 178 can have tristate outputs with the clock line 182 
coupled to the address registers 176, 178. Thus, the clock 
switches the address registers 176, 178 in alternating fashion 
onto a common bus attached to memory 162. Other modi- 
fications can also be made to the hardware, but the general 
timing diagram of FIG. 8B should remain the same. 

FIG, 8A is a flowchart summarizing the steps for concur- 
rently performing source and destination address searches. 
During a first clock cycle, a network address is obtained 
from the lookup table as a first phase of a source address 
iteration (step 194). During a second clock cycle (step 196), 
the network address is compared to the SA search key The 
next address calculation logic 172 reads the comparator 164 
and determines what lookup table address should be 
accessed next. Also during the second cycle, the lookup 
table is accessed for the next network address needed for a 
first phase of a destination address iteration and the source 
address iteration is completed. Finally, during a third cycle 
(step 198), the destination address iteration is completed by 
resolving the next network address to analyze for the des- 
tination address search. Also, the first phase of the source 
address iteration is repeated. Although only three clock 
cycles are shown, the clock is continuous. 

A wide variety of searching techniques can be used with 
the concurrent search embodiment including binary, linear, 
CAM, binary radix trees, hashing and other types of 
searches. 

FIG. 9 shows another example of a lookup table 200 that 
may be used according to the invention. The lookup table 
includes a list of pointers 20(2 and network address data 204 
stored in the memory within the switching device. The 
pointers point to the network address data 204. An example 
of actual memory addresses is shown in column 206 for 
illustrative purposes. Notably, neither the network addresses 
204 nor the list of pointers 202 are stored in sorted order. 
However, the contents of data pointed to by the fist of 
pointers 202 are stored in sorted order. For example, a 
pointer 208 points to address 107, which contains network 
address 159:22:0. A next pointer 210 in the list points to 
address 108 which contains the network address 159:22:99. 
The next pointer 212 points to address 106 which contains 
the network address 159:66:123. Although the list of point- 
ers 208, 210, and 212 are not in order (since the pointers are 
107, 108, 106, respectively) the data pointed to by the 
pointers is in order since addresses 159:22:0, 159:22:99 and 
159:66:123 are in ascending order. 

Thus, the lookup table according to the invention does not 
need to have data stored in contiguous addresses. Instead, 
network addresses may be in one memory location and 
forwarding information (such as port numbers) may be 
stored in other locations of memory. Pointers or some other 
technique for linking associated data is then used. 

Having described and illustrated the principles of our 
invention with reference to preferred embodiments thereof, 
it wiU be apparent that these embodiments can be modified 
in arrangement and detail without departing fi-om the prin- 
ciples of the invention. 

For example, although the port is shown as including 
separate components, such components can be formed in a 
single integrated circuit. Additionally, other circuit 
components, such as the port intercommunication logic 54 
can also be included in the same integrated circuit as the port 
52. 
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Additionally, any of the components of the switching 
device can be performed by hardware, software, or a com- 
bination thereof. The invention should not be limited to the 
particular technique (whether hardware or software) for 
carrying out the methods and apparatus described herein. 
For example, the search engine can be a microprocessor 
running software or an ASIC where the searches are per- 
formed in hardware. 

Further, although the seardi engines are generally shown 
performing equal iterations of binary searches, it is not 
necessarily so. That is, the search engines can perform 
unequal iterations of searches. For example, in a 16 iteration 
search, a first search engine can perform 9 iterations and a 
second search engine can perform 7 iterations. Other varia- 
1^ tions in the number of iterations can easUy be applied and 
depends on the particular application and number of search 
engines used. 

Still further, although the search engine of FIG. 7 shows 
two comparators and two next address calculation logic 
circuits, one of each may be used instead, with the clock 
controlling whether the SA address register or the DA 
address register is read by the next address calculation logic 
circuit. Additionally, the clock controls the single compara- 
tor so that only one of the DA data register and SA data 
register are compared at a time to one of the SA and DA 
search keys. 

StiU yet further, although the lookup table is shown as 
having all memory locations filled, in practice this is not 
generaUy the case. Instead, the memory has only a portion 
filled with lookup table values and the remaining portion is 
padded with dummy values. For example, the table of FIG. 
4Amay have only 9 lookup table entries with the remaining 
memory locations containing dummy values. In the case of 
an ascending table, the dummy values are a number sufQ- 
ciently large enough that no network address can be con- 
fused with them. In the case of a descending table, the 
dummy values are a number sufBciently small enough that 
no network address can be confused with them. 
Additionally, the lookup table of FIG. 4A contains only 16 
entries for illustration. In practice, lookup tables contain 
hundreds of thousands of entries. 

Still fiirther, although the lookup table is shown as con- 
taining table entries with multiple fields, the lookup table 
may be a list of pointers, and the pointers point to the 
network addresses and other fields. Additionally, the lookup 
table can be any type of data structure or array that store 
data. 

Yet further, the network packets can include any kind of 
data including video images, voice data during a phone call, 
a document, etc. 

Still fiirther, when a port receives a network frame, it can 
analyze the destination address (at layer two and layer three) 
and it can also analyze layer 4 policy information if desired. 

Still yet further, the present invention may be applied to 
LAN'S, WAN's, the Internet, Intranets, telephone networks, 
or any other network. 

Additionally, although the lookup table is described as 
containing network addresses, the lookup table may include 
other keys. A key may include a network address exclusively 
or in combination with additional information. Alternatively, 
a key may include lookup information other than network 
addresses. Additionally, the switching device may receive 
network frames that include search keys. Alternatively, the 
search key may be derived from the network frame. For 
example, the search engine may use characteristics of the 
network frame, such as what port it was received on or what 
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protocol format it is in, to derive the search key. The search the second memory only includes a second portion of the 

key, which is either explicitly included in the network frame lookup table necessary to perform the second portion of the 

or derived therefrom, is compared to the lookup table which N iterations. 

includes lookup keys. 7. The network switching device of claim 1, wherein the 

In view of the many possible embodiments to which the 5 first and second search engines are binary search engines, 

principles or invention may be applied, it should be recog- 8. In a network switching device, a method for searching 

nized that the illustrated embodiment is only a preferred a lookup table comprising the steps of: 

example of the invention and should not be taken as a receiving a data packet including a first key and a second 
limitation on the scope of the invention. Rather, the inven- 

tion is defined by the following claims. We therefore claim *. , /. , . , t i t 

as the invention all such embodiments that come within the extracting the fiist and second keys from the daU packet; 

scope of these claims. performing by a first search engine a search of a first 

We claim: memory for the first key; 

1. A network switching device comprising: performing by the first search engine a search of the first 
a first search engine; memory for the second key concurrently with the 
a first memory coupled to the first search engine; search for the first key; and 

a second search engine; passing results of the search for the first and second keys 

a second memory coupled to the second search engine to ^ second search engine coupled to the first search 

characterized in that the first search engine performs a 20 engine for concurrently searching the second memory 

search of the first memory for a first key concurrently for the first and second keys. 

with a search of a second key and passes results of the 9. The method of claim 8, wherein the first key is a source 

search for the first and second keys to the second search address and the second key is a destination address, 

engine for concurrently searching the second memory 10, The method of claim 8, wherein each search engine 

for the first and second keys. 25 accesses its respective memory during successive clock 

2. The network switching device of claim 1, wherein the cycles and alternates between accessing the memory for 
first key is a source address and the second key is a search of the first key and search of the second key. 
destination address, 11. The method of claim 8, wherein each search engine 

3. The network switching device of claim 1, wherein each performs iterations of searches of the first and second keys 
search engine accesses its respective memory during sue- 30 in three clock cycles, wherein during at least one of the three 
cessive clock cycles and alternates between accessing the clock cycles an iteration of the search for the first key is 
memory for search of the first key and search of the second completing while an iteration of the search for the second 
key. key is beginning such that there is an overlapping between 

4. The network switching device of claim 1, wherein each execution of the iterations of the searches of the first and 
search engine performs iterations of searches of the first and 35 second keys. 

second keys in three clock cycles, wherem during at least 12. The method of claim 8, wherein the first and second 

one of the three clock cycles an iteration of the search for the search engines together perform a search of N iterations, and 

first key is completing while an iteration of the search for the wherein the first search engine performs a first portion of the 

second key is beginning such that there is an overlapping N iterations and the second search engine performs a second 

between execution of the iterations of the searches of the 40 portion of the N iterations. 

first and second keys. 13. The method of claim 12, wherein the first memory 

5. The network switching device of claim 1, wherein the only includes a first portion of a lookup table necessary to 
first and second search engines together perform a search of perform the first portion of the N iterations and the second 
N iterations, and wherein the first search engine performs a memory only includes a second portion of the lookup table 
first portion of the N iterations and the second search engine 45 necessary to perform the second portion of the N iterations, 
performs second portion of the N iterations. 14. The network switching device of claim 1, wherein the 

6. The network switching device of claim 5, wherein the first and second search engines are binary search engines, 
first memory only includes a first portion of a lookup table 

necessary to perform the first portion of the N iterations and ♦ ♦ ♦ ♦ ♦ 
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ABSTRACT 



In order to transparently red irect an HTTP connection 
req uest that is directed to an origin serve r (l{yf)to a proxy 
cache (110-1), a^MPxyjedirect or (104) translates the de sti- 
n ation address of packets directed Jo^t he origin server to the 
a ddress of the prox y. During a handshaking procedu re, a 
TCEJco nnection is transparently established between t he 
cUent (110-1) aAd.thep^ox vcache. When the client ^ transm its 
a GET re q^uest to^wh at it thinks is the orip ; in server, whi ch 
req uest spedfle s T he~com^^ of an^ob jegt"at" tha t 

onSn^rver that itwants a copy of, the proxy redirectbr 
mocfifies the complcfe^ddress specified in tbat.GET reques t 
b efore it is sent to the proxy cache . Specifically, the IP 
a ^ress of the ong n server ^ found in the destinatioji.fLeld in 
tb^gLheadSl Zfthe oneior,more,packets from the chent 
c ontaining the GET reQuestis ad ^ d^by the^proxy redi rector 
asa^prelix to tlbe co mpletcCyR t he GETlg q ugUo form 
anTab^lut^ y^Eniic proxy ca che determines m>m t hat*^ 
a ^fct?jL[RL^ it has.the ^ ^ c qucstc d^^i ect stored in 
^^..tachc. If it docs, i t sends A ^bject ^back to Ihc prox y 
r edita Et pr. which masquera des tho ^ packe ts as coming from 
t he ori g in serve r j ~ 



: transiatatpg their destiaation add ress to 

the address of the client and their sourcTlSSfess tolhat of 
the origin j;erver. If the proxy does nolThave the requested 
object,.a separate TCT^c onnectioQ js ^established between the 
prox y and the origin scnrc r from where the object is 
ret rieved a nd the n forwarded_o xciL.the_TCP_connection 
be twee n the^ent an d jhe prox)[jn order.to account for the 
addUion3l.D umS,rj3f-b,Ytes^inJh e_GEXrequest,,an-ackDowl- 
edgement sequence- numb^rjn . paclcets returned from the 
proxy Ih'at'ldgically'fol low receipt of the GET request are 
d ecremented by that number by the proxy rediiector before 
being^^vw^ed tqjhe^ Similarly, a"sequence number \ 
in paclreS^ansmitted by the chent subsequent to the GET 
request are incremented by that number before being for- 
warded by the proj^y-iedirc gtor to the pjo xy cache. — ^ 
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METHOD AND APPARATUS FOR 
TRANSPARENTLY DIRECnNG REQUESTS 
FOR WEB OBJECTS TO PROXY CACHES 

HELD OF THE INVENTION 

This invention relates to packet-switched computer 
networks, and more particularly, to a method and apparatus 
in such a network for transparently intercepting client web 
requests and redirecting them to proxy caches. 

BACKGROUND OF THE INVENTION 

Proxy caching is currently used to decrease both the 
latency of object retrieval and trafBc on the Internet back- 
bone. As is well kQ0^iiJLLa4)K)X5L£a^e has stored a copy 
of an object trom anorigin server that has'Been-ccqilfealed by 
a client, the rprjiiftRtpH rh\p^\ is Rupplj ed to the client from 
the proxy cac H ^^ther than frnm thf* Qrip;iD_server. This, 
therefor e, obviates the need to send the request over a wi de 
area netwo rjcTsuch as the Internet, to t he orig i q server w here 
the Qii ^arobject is stored and the respon^ yg rransrpi^^^'^n 
of a copy of the regneyiitrrf^ oKjprT hanV nver the network to 
th e requesting client. 

Direction of a request from a client to a proxy cache to 
determine whether a requested copy of an object is stored in 
the cache can be accomplished either tran^arently or non- 
transparently to the client. Non-transparent redirection is 
accomplished through the client's browser program which is 
configured to send all object requests to a designated proxy 
cache at a specified address. Generally, a browser can be 
configured to send all of its client requests to a designated 
proxy cache if the client is connected on a Local Area 
Network (LAN), or on an Intranet behind a firewall, where 
a proxy cache associated with that LAN or Intranet is 
located. When clients are served by a large Internet Service 
Provider (ISP), however, it is not advantageous from the 
ISP's standpoint to allow its subscribers to set their browsers 
to a specific proxy cache associated with the ISP. A large ISP 
likely will have many proxy caches in several locations and 
will thus want to maintain control over which of its several 
particular proxy caches a client request is directed. Further, 
if a proxy cache whose address is statically set in a client's 
browser becomes inoperative, all client requests will fail. 

It is therefore more desirable from an ISP*s standpoint 
with respect to latency and minimizing traffic onto and oE of 
the network to transparently intercept a client's web request 
and send it to one of its operative proxy caches to determine 
whether a copy of the requested object is stored there. If a 
copy of the requested object is then found to be stored in that 
proxy cache, a copy of the object is sent to the client, which 
is unaware that it has been served an object from the proxy 
cache rather than from the origin server to which it made the 
request. If the proxy cache does not hold a copy of the 
requested object, then a separate connection is established 
between the piroxy cache and the origin server to obtain a 
copy of the object, which when returned to the proxy is sent 
to the client over the connection established between the 
client and the proxy. 

When a client specifies a URL of the object it is requesting 
a copy of, a Domain Name Server (DNS) look-up is per- 
formed to determine from the URL an IP address of an origin 
server which has that requested object. As a result of that 
look-up, an IP address is returned to the cfientof one of what 
may be several substantially equivalent servers that contain 
that object. The client then establishes a TCP connection to 
that server using a three-way handshake mechanism. Such a 
connection is determined at each end by a port number and 
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an IP address. First, a SYN packet is sent from the client to 
that origin server, wherein the destination IP address speci- 
fied in the packet is the DNS-determined IP address of the 
origin server and the destination port number for an HTTP 

5 request is conventionally port 80. The source IP address and 
port number of the packet are the IP address and port number 
associated with the client. The client IP address is generally 
assigned to the client by an ISP and the client port number 
is dynamically assigned by the protocol stack in the client. 
The origin server then responds back to the client with an 
ACK SYN packet in which the destination IP address and 
destination port are the client's IP address and port number 
and the packet's source IP address and port niunber are the 
server's IP address and the server's port number, the latter 
generally being port 80. After receipt of the ACK SYN 

^5 packet, the client sends one or more packets to the origin 
server, which packets include a GET request. The GET 
request includes a complete URL, which identifies to that 
server the specific object within the origin server site that the 
client wants a copy of. Unlike an absolute URL, which 

20 includes both site information (e.g., www.yahoo.com), and 
object information (e.g., index.html), a complete URL only 
identifies the particular object (e.g., index.html) that is 
requested since the packet(s) containing the GET request is 
sent to the proper origin server site by means of the 

25 destination address of the packet(s). 

When a browser is configured to non-transparenlly send 
all requests to a proxy, a GET request is formulated by the 
browser that includes the absolute URL of the requested 
object. That absolute URL is then used by the proxy to 

30 establish a separate TCP connection to the origin server if 
the proxy does not have a copy of the requested object in its 
cache. The proxy requires the absolute URL since the 
destination address of the packets to the proxy is set by the 
browser to the IP address of the proxy rather than the IP 

35 address of the origin server. Thus, in order to determine 
whether it has the object in its cache and if not establish a 
connection to the origin server, the proxy requires the 
absolute URL of the origin server in the GET request. 
When requests are transparently directed to a proxy cache, 

40 however, the client browser is unaware that the request is 
being directed to the proxy and is possibly being fiilfiUed 
from the cache. Rather, the client's browser needs to "think" 
that it is connected to the origin server to which its SYN and 
the packet(s) containing the GET request are addressed. 

45 Such origin server IP address is determined by the browser 
through a. DNS look-up. Further, the source address of the 
ACK SYN packet and the packets containing the requested 
object must be that same origin server IP address or they will 
not be recognized by the browser as being the responsive 

50 packets to the SYN packet and the request for the object. 
Thus, in order to transparently send object requests to a 
proxy cache, a mechanism must be in place along the packet 
transmission path to intercept an initial SYN packet sent by 
a browser and to redirect it to the proxy cache to establish 

55 a TCP connection. The proxy cache must then masquerade 
as the origin server when sending the ACK SYN packet back 
to the client by using the origin server's IP address and port 
number as the source address of that packet. Further, the 
subsequent packet(s) containing a GET request must be 

60 redirected to the proxy cache and the request fulfilled either 
from the cache or via a separate TCP connection from the 
proxy to the origin server. In either case, the source address 
of packets sent back to the client must be the origin server's 
IP address and port number to which the padsets sent by the 

65 client are addressed. 

In order for packets associated with a request for an object 
to be redirected to a proxy cache connected somewhere in 
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the network, a Layer 4 (L4) switch on the packet path and then the packet(s) containing the GET request to the 
"looks" at the port ntunber of a destination address of a SYN origin server, the L4 switch again performs a NAT, trans- 
request packet. Since HTTP connection requests arc gener- lating the destination IP address to the IP address of the 
ally directed to port 80 of an origin server, the L4 switch proxy. Thus, when the packet(s) containing the GET request 
transparently redirects all packets having a port number of 5 is received by the proxy, it does not know the IP address of 
80 in the destination address. The SYN packet is thus sent the origin server as in the direcUy connected proxy arrange- 
to a selected proxy cache. In order for the proxy cache to ment described above. The proxy must therefore look at the 
properly respond to the client, as noted, it must know the logical name in the HOST field and perform a DNS look-up 
absolute URL of the requested object and packets returned to determine that site's IP address. The proxy then uses that 
to the cHent must masquerade as coming from the origin 10 IP address in combination with the complete URL in the 
server. Unlike the non-transparent caching method previ- GET request to form an absolute URL from which it 
ously described in which the browser formulates a GET determines whether it has the requested object in its cache, 
request with the absolute URL, for transparent caching the If it doesn't, a separate TCP connection is established from 
absolute URL must be provided in some manner to the proxy the proxy to that absolute URL to retrieve that object, which 
cache in order for the proxy to determine whether it in fact is is returned to the proxy. Whether the object is found in the 
has the requested object in its cache, or whether it must proxy cache or is retrieved over the separate connection 
establish a separate TCP connection to the origin server to from the origin server, it is forwarded back to the L4 switch 
request the object. In the prior art, when one or more caches where a NAT and PAT are performed to translate the 
are directly connected to the L4 switch, the switch dbooses destination address to that of the client and to translate the 
one of the caches and transparently forwards the packets to 20 source address to the particular origin server to which the 
that proxy without modifying the source or destination client's request was directed. It should be noted that the 
address of the packets. The proxy, working in a promiscuous source address of the origin server obtained when the 
TCP mode accepts all incoming packets regardless of their client's browser initiates a DNS look-up using the origin 
destination address. The proxy, then receiving the SYN server's absolute URL may not be the same IP address 
packet with the origin server's destination address and the 25 obtained when the proxy performs a DNS look-up using the 
cUent's source address, can respond to SYN packet with an combination of the site URL in the HOST field and the 
ACK SYN packet. This ACK SYN packet has the client's complete URL in the GET request, 
address as a destination address and a source address mas- The above described techniques for performing transpar- 
querading as the origin server address. This packet is trans- ent proxy caching have several disadvantages. Firstly, use of 
ported through the L4 switch onto the network over the TCP 30 a HOST field to specify a logical name of an origin server 
connection back to the client. The subsequent packet(s) with is not currently incorporated within the presently employed 
the GET request from the client is redirected by the L4 HTTPl.O stamlards. Thus, a HOST field may not be present 
switch to the directly connected proxy. Since the GET in the packet(s) containing a GET request. Where, as 
pack6t(s) only contains the complete URL, the proxy must described above, the information in the HOST field is 
formulate the absolute URL to determine whether its has the 35 necessary to form an absolute URL to determine whether the 
requested object in its cache or whether is must establish a proxy cache has the requested object and, if not, to establish 
separate TCP connection to the origin server. The proxy a connection to an origin server from the proxy, the absence 
forms the absolute URL by prefixing the complete URL in of the HOST field results in an unfilled request. Secondly, 
the GET request with the IP address of the origin server in the prior art techniques require the proxy cache to perform 
the destination address of the packet. The proxy can then 40 the function of forming an absolute URL from the informa- 
determine whether it has the object and, if not, establish a tion in the HOST field and in the packet(s) containing the 
TCP connection to that absolute address. If that particular GET request. Thus, standard proxy caches which expect the 
origin server at that IP address should be inoperative, the client's browser to produce the absolute URL cannot be 
proxy can alternatively prefix the complete URL in the GET used. A methodology for tran^arcnt proxy caching that is 
request with the logical name of the site indicated in the 45 transparent to both the client and the proxy is desirable to 
HOST field in the packet(s) containing the GET request. avoid modification to the program that controls proxy cache 
In the prior art, if the proxy cache is not direcdy connected operations. Standard proxy caches could thus be employed 
to the L4 switch, then the L4 switch must perform a network anywhere in the network without the need for a special 
address translation (NAT) and port address translation (PAT) implementation. 

on those packets directed to port 80 of an origin server. 50 The above described prior art techniques have even 

Specifically, when the L4 switch receives a SYN packet to fiirther disadvantages with respect to persistent connections 

initiate a TCP connection from a client to an origin server, defined by the HTTPl.l standards. As defined by these 

it translates the destination address of the packet from the IP standards, a persistent connection enables a client to send 

address and port number of the origin server to the IP plural GET requests over the same TCP connection once that 

address and port number of a selected proxy cache. Further, 55 connection has been established between two endpoints. 

the switch translates the source address of the packet from When a prior art transparent proxy cache is interposed on the 

the clients IP address and port number to its own IP address connection, a client may "think" it has established a persis- 

and a port number. When the proxy responds with an ACK tent connection to the specific origin server determined 

SYN packet, it therefore responds to the L4 switch where a through the DNS look-up. The connection in reality, 

NAT translates the destination IP address from the IP address 60 however, is transparently diverted by the L4 switch to a 

of the L4 switch to the IP address of the client, and translates proxy cache. The proxy cache, in response to a DNS look-up 

the source IP address from the IP address of the proxy to IP using the logical name in the HOST field, may be directed 

address of the origin server. A PAT also translates the port to an equivalent origin server at a different IP address, 

number in the destination address from that of the L4 switch Further, as each subsequent GET request is received by the 

to that of the client, and translates the port number in the 65 proxy from the client within the client's perceived persistent 

source address from that of the proxy to that of the origin connection, each responsive DNS look-up to the logical 

server (usually 80). When the client sends an ACK packet name may direct a connection to an even different IP address 
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of an equivalent origin server. As a result, the advantages of prefixed by the proxy redirector to form the absolute URL 

a transaction-oriented persistent connection in which a (whether the GET request is one or more segments long) is 

server is capable of maintaining state information through- less than or equal to the actual MSS that the proxy can 

out the connection, are lost. A methodology is desirable that receive. 

maintains persistence to the same origin server to which the 5 gj^p DESCRIPTION OF THE DRAWING 

clients browser is directed, or to a same equivalent ongm 

server throughout the duration of the persistent connection. FIG. 1 is a block diagram of a network that includes a 

proxy redirector that transparently sends requests from a 

SUMMARY OF THE INVENTION client to a proxy cache by changing the destination address 

The problems associated with the prior art techniques for lo of packets in a client request from that of the origin server 

transparent proxy caching are eliminated by the present to that of a proxy and the source address from that of the 

invention. In accordance with the present invention, a client to that of the switching entity and, in accordance with 

switching entity, such as the L4 switch (referred to herein- the present invention, modifies a GET request to include the 

after as a proxy redirector), through which the packets flow, destination address of the origin server; 

is provided with the functionalities at the IP level necessary 15 FIG. 2 is a block diagram showing the proxy redirector 

to transform the complete URL in each GET request trans- implemented on a programmable network clement that 

mitted by a client to an appropriate absolute URL. manipulates packets in accordance with instructions pro- 

Specifically, the IP address found in the destination field in vided by a loaded program; and 

the IP header of the packet(s) from the client containing the pjos 3^ 4^ 5 and 6 arc flow charts detailing the operation 

GET request are added as a prefix by the proxy redirector to 20 of the proxy redirector. 
the complete URL in the GET request. As a result, the 

complete URL in the GET request is modified to form an DETAILED DESCRIPTION 
absolute URL which, when received by the proxy cache, is With reference to FIG. 1, a plurality of clients 101- 
dircctly used to determine if the requested object is stored in 1-101-N arc connected to a local area network (LAN) 102, 
the cache and, if not, to establish a separate TCP connection 25 such as an Ethernet LAN 102, which, in turn, is connected 
to the origin server. The GET request received by the proxy through a router 103 to a Level 4 (L4) switch 104 (proxy 
is thus equivalent to what it would expect to receive if it redirector )_whic h- interfaces the LAN with a wide are a 
were operating in the non-transparent mode. network (WAN)- 105._such^as_tbeJ ntejiiet. Although shown 
Advantageously, if a persistent connection is established, as two separate elements, the functionafities of router 103 
each subsequent GET request has the same IP address prefix 30 a nd proxy redirector 104_can in actual p ractice be combined 
determined by the initial DNS look-up by the client in a single unit. All requests from any of the clients con- 
By modifying the GET request at the proxy redirector to nected to LAN 102 for objects stored in servers connected 
include the destination address of the origin server, the to the Internet 105 traverse g roxv redirector 104 onto the 
number of bytes at the IP level in the packet containing the Inter net. The packets comprisin g _sucb_icqucsts,^wliich 
resultant absolute address are increased by the number of 35 include the standardized packets that establish a TCP 
bytes in the prefix. Inchided in the header within each packet co nnecflon, are directed to an IP destination ad dress and port 
is a sequence number (seq) that provides an indication of the nu mber indicatedio the IP head^ oLeac h packet ori gmating 
position of the first byte number in the payload. Thus, when from_a_ dient source address that inclu des^cdisnt IP address 
the IP address is added to a packet, the sequence number of an d port numbe r. Similarly, responses to such requests from 
each of the subsequent padcets needs to be incremented by 40 an origin server connected to Internet 105 are directed via an 
the count of the added bytes. Further, an acknowledgement I Pdestination add ress that jseq ual to the client's IP address 
sequence number (ack_seq) in the header on the packets and port number from which the requ est originated, and 
retximed from the proxy or the origin server that logically have asa^source address the server's IP address and port 
follow receipt of the GET packet(s) at the origin server numbeTTAl Tsuch packets directed to any of the clients 
needs to be decremented by the proxy redirector before 45 101 -1-101-N fronLan y server connected to ktc met 105 pass 
being fiarwarded to the client to avoid confusing the client tfirough^roxyredirector 104. 

with reject to what the sequence number of the next byte When any of tfie clients cocmcctcd to LAN 102, such as 

it sends should be. Further, if the GET request sent by the client 101-1, makes a request through a browser for an 

client encompasses more than one TCP segment, then the object by specifying a logical URL, a domain name server 

extra bytes in the first of the segments caused by the 50 (DNS) 106 connected locally or on Internet 105, as shown, 

additional bytes added to the URL are shifted into the second is accessed to perform a database look-up based on that 

segment, and the resultant now extra bytes in the second logical name. An associated IP address is then returned to the 

segment are shifted into the third segment, etc., until the last browser. The IP address returned to the browser is the IP 

of the segments. In order to preclude the necessity of address of a particular origin server which contains the 5 

requiring an extra segment to be added to the GET request 55 objea requested through the logical URL. Since a logical 

to accommodate the extra bytes, the client sending the GET name may in fact be associated with a plurality of essentially 

request, is deceived into sending segments whose maximum equivalent origin servers, such as servers 1(^7 and 109, the 

size is less than what can actually be received by the proxy particular IP address returned to the client browser chosen 

as indicated by a maximum segment size (MSS) field in by DNS 106 may be determined in a round-robin manner, 

packets from the proxy. The proxy redirector, upon receipt 60 When DNS 106 selects an origin server corresponding to the 

of the On, ACK SYN packet from the proxy, reduces the logical URL, the IP address of the selected origin server, 

MSS parameter received from the proxy by the amount of such as, for example, the IP address of origin server 107, is 

the number of bytes that will be added to the GET request returned to the browser in the requesting client 101-1. That 

before that parameter is forwarded to the client Thus, when IP address then serves as the IP address to which packets 

the client next sends a GET request, each segment is limited 65 directed to the origin server from the client are directed, 

to the reduced MSS, thereby insuring that the segment size Conventionally, http requests are usually directed to port 80 

of a last segment in a GET request after the IP address is of an origin server. 
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With the IP address of the origin server determined and have been manipulated in accordance with the gateway 

returned to the client, the browser establishes a TCP con- program back to the kernel for output back to the network 

nection between the client and the origin server through a through filter 203 through network interfaces 208. Library 

three-way handshaking process. Specifically, a SYN packet, functions are provided in the programmable network ele- 

addressed to the IP address of the selected origin server, is 5 ment that enable a gateway program to communicate with 

sent by the client Handshaking is completed when the client the dispatcher process 202 to register rules that specify the 

receives an acknowledgement of receipt of that SYN packet type of IP packets that a gateway program wants diverted to 

through an ACK SYN packet sent by that origin server, and it. A gateway program can request either a complete IP 

responds with a ACK packet to the origin server. The packet or only the IP and TCP header of a packet and can 

browser then sends a GET request that specifies the particu- jg change b oth the header and payload of a pac ^t 
lar requested object. In the present invention, that programmable network 

In accordance with the present invention, once the IP element is operative in combination with a gateway program 

address of the origin server corresponding to the logical that manipulates the destination and soiu-ce addresses of 

URL name is determined through the DNS look-up, proxy packets flowing there through in a manner to be described, 

redirector 104, rather than establishing a TCP connection to 15 as well as modifying, as will be described, information in the 

the origin server at the determined IP address, traijsparently packet(s) containing the GET request that specifies the URL 

-^ablishes a TCP connection between the client and a proxy. of the requested object. Specifically, the programmable 

(jpOnt requested object is stored in the cache, a copy of that network element in combination with the gateway program 

object is transparently returned to the requesting client. A operates on packets associated with HTTP requests, which 
TCP connection, therefore, is not established over the Inter- 20 determined firom the destination port number. As previ- 

net 105, to the actual origin server 107 to provide the ously noted, HTTP requests are conventionally addressed to 

requested object to the requesting client. The ooss of trans- port 80 of an origin server. Thus, the programmable network 

milling the. request to the origin server over the Internet and element/gateway program which together comprise proxy 

transmitting the copy of the requested object back over the redirector 104 in this embodiment, captures through the 

Intemet are thereby saved in addition to the time required for 2s dispatcher process of the programmable network element, 

transmitting such a request over the Internet and waiting for packets directed to port 80 and then performs address 

a response firom the origin server. Illbe^roxy_cachejo translations on those captured packets to readdress these 

which-the-req uest i s directed does not contain the requested packets to a selected proxy. With respect to address 

ob^ct..a.separat e,TCP connection is established hetv^e en the translations, the gateway program translates the destination 

proxY_cache and the origin server to obtain a cop y of the 30 IP address of packets addressed to the origin server to the IP 

reque sted object. When fhe pmr y cache then rec eives the address of a selected proxy cache and translates the source 

cop y oT the requested object"fiSm an origin ser ver over that IP address of such packets from that of the client to the IP 

sep arate TCP c onnection, the copy is forwarded to the chent address of proxy redirector 104. Further, in order for proxy 

over me ongmul TCP connection that was established redirector 104 to identify requests from plural client termi- 

hetween th^Iient-and-the- proxy cach e. 35 nals that are directed to the same proxy, the source port 

In the embodiment shown in FIG. 1, a proxy cache 110-1 number is translated to a bogus ghost port number at the 

is illustratively shown connected to a LAN 111, which is proxy redirector. Thus, when prox y cache responds, t he 

connected to LAN 102 through a router 112. Another proxy packets^tr ansmitted by the cacEniave_a destination ^ IP 

cache 115 is shown connected on a different LAN 116 addressof prox y redirector 104 at tb at.bogus port num ber, 

through router 103. Other proxy caches can be located 40 wEichlsdistinctly associated with the client The gateway 

anywhere on LANs 102, lU, or U6, on another LAN program within proxy redirector 104 then translates the IP 

connected to the Internet 105 such as proxy cache 117. destination address of these responsive packets from the 

Proxy redirector 104 selects one of the available proxy proxy to the IP address of the client and translates the bogus 

caches to which to forward client requests based on a metric destination port number to the port number from which the 

f such as least-loaded or round-robin, haseH nn TP heflde r 45 client originated its request. Further, the gateway program 

U i nformation such as the origin server IP address. With translates the source IP address of such responsive packets 

I rcs pect_to the latter^ all objects firom a specific ori pin server firom that of the proxy to the IP address of the origin server 

f willl)e served by a ^ecific proxy. and the port number to the port (80) to which the client's 

In the preferred embodiment described herein, proxy requests were originally directed. Thus, the packeU which 

redirector 104 includes a programmable network element of 50 are returned to the client from the proxy masquerade as if 

the type described in co-pending U.S. patent application Ser. they had originated from the origin server to which the client 

No. 09/190,355, filed Nov. 12, 1998, which appKcation is "believed" its request had been sent, 
incorporated herein by reference. As described in that By performing the above-described network address 

application, that programmable network element in the translations (NATs) and port address translations (PATs), 

preferred embodiment runs on a Linux machine. As shown 55 packets from a client 101-1 are transparendy directed by 

in FIG. 2, the programmable network clement 200 includes prox y redir e.ctor_lQ4-to-a, prox y cache. ResponsivjELpackets 

a dispatcher process 202 with which plural differentgateway from the proxy cache are sent to proxy redirector 104 where 

progr,anas,(204, 205 and 206) register and request acS^ss to they are redirected to client 101-1. 
iP_p_adcets_thal_fi.tjp^cificjdescription5, Such programs are In ^establishing. a-TCP __connect ioD_that_is directed to an 
loaded throu gh an admission daem on. 210 via a local pro- 60 ori^n_server, ,client-101-l_first transmits a SYN packet, 

gram injector 211 or a remote program injector 212. A wtii ch is interce pjed_bv_pK)xv,redirector_104.-Eroxy.^redi- 

g atewav program, for e xamplfi^ can request access to inco m- rector.l0>4- se lects a proxv^ache,^uch,as,proxv_110-l,Jo 

ing packets to network interface 2 08 that match certain r edirect this request and creates aj:o nnectio n control blo ck 

source and destination lP address ranges and port num bers. (CCB) to.maintain.information.ab out the connection, Selec- 
flbe dispatcher process 202 uses a pac ket filter 203 in th e 65 ti on of the particular, prox y is determine d, as de scribed 

Linux kernel 201 to obtain packets requested by the gatew ay I bove, by one of several jMj ss ible al go"'*^^"'*^ Thr CCB is 

programs and tises a raw IP socket 215 to send packe'tsthat used to store tfie^lient'IPTddrcss and TCP port number and 
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the origin server IP address and TCP port number, both of 
which are contained in the IP header of the SYN packet, and 
the chosen proxy's IP address. The destination address is 
then changed to that of the chosen proxy and the packet is 
sent back to the network for redirection to its new destina- 
tion address of the proxy 110-1. All subsequent packets that 
originate from the same client with the same TCP port 
number are then forwarded to the same proxy. Proxy 110-1 
responds with an ACK SYN packet which is directed via its 
destination address to proxy redirector 110-1. Proxy redi- 
rector 104 then translates the source IP address and port 
number to those of the origin server and the destination IP 
address and port number to those of the client. When the 
packet arrives at the client the client believes that it is 
connected to the origin server. Hie client then responds with 
an ACK packet to the origin server, which is redirected by 
proxy redirector 104 to proxy cache 110-1, to complete the 
handshaking process. 

After the TCP connection is established between cUent 
101-1 and proxy cache 110-1, cHent 101-1 sends one or 
more packets containing a GET request addressed to the 
origin server. Such packets arc thus "captured" by proxy 
redirector 104 and redirected to proxy cache 110-1. As 
previously discussed, the GET request sent by the client 
contains only the complete URL sent by the client browser 
which in itself provides insufiScient information for the 
proxy cache to determine whether it has the requested object 
and, if not, to forward it to the origin serve ,r which does. In 
accordance with the present invention, the gateway program 
that is operative with the programmable network element of 
the proxy redirector 104, captures this padcet or packets and, 
in addition to previously described address translations, 
transforms the complete URL to an absolute URL by pre- 
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fixing it with the IP address of the origin server obtained 
from the destination IP address of the packet(s) containing 
the GET request. Thus, Level 7 (application) inJformation is 
modified to assist in level 4 routing. 

5 In order to make the URL transformation transparent to 
both the client and proxy cache endpoints, changes in IP and 
TCP headers are also required. Since the GET request 
modification increases the length of the IP packet that carries 
the GET request, the total length field on the IP header of this 

10 packet is increased by an offiscL The offset amount is 
recorded in the CCB. In addition, the TCP header contains 
sequence numbers (seq) and acknowledgement sequence 
numbers (ack_seq) that need to be translated. The seq in the 
TCP header indicates the byte number of the first byte on this 

15 packet going from the sender to the receiver over the TCP 
session and the acle_seq indicates the byte number of the 
next byte that the sender expects to receive fi-om the 
receiver. For all packets after the GETpacket(s) that go from 
the client to the proxy cache, the seq is increased by an o£fset 

20 equal to the lenglhof(absolute URL)-lengthof(complet6 
URL) so that the seq matches the byte number of the byte 
that the proxy cache expects to receive from the client. 
Similarly, on all packets starting with the acknowledgement 
to the GET packet that go from the proxy cache to the client 

25 through the proxy redirector, the ack^seq is decreased by 
the same ofket so that the ack_seq matches the byte number 
of the byte that the proxy cache would expect the client to 
send in the next packet following the GET packet. By 
performing these changes in the header, the client and proxy 

30 cache endpoints remain unaware of the modification in the 
GET packets from the complete URL to the absolute URL. 

Table 1 illustrates the URL and other header transforma- 
tions performed 



TABLE 1 



Arriving packet: 

— > beginning of packet header dump <~ 

— > IP header: version-4 hdr_len-5 TOS-0 pkt_len-346 id-60276 

frag_ofiMOOOH TTL«»64 protcicol«6 cksum=1792H 

saddr-135.104.25.243 daddr-204.71 .200.244 
™> TCP header sport=1273 dport=«0 seq=21S9084427 ack_seq=3266449517 

tcp_hdr_Jca=5 flag5=ACK PSH 

resl-OH re62-0H window^l856 ck«ij»-162H uigent-0 
— > beginning of packet data dump < — 
GET ^/ya/yahoomaii/promol.gif HTTP/ID 
Referer: ht^://www.yahoo.coin/ 
Connection: Kccp-Alivc 

User-Agent: Mozilla/4.05 [en] pCll; U; Linux 2.1.103 i6S6) 
Host: us,yimg.com 

Accept: imagc/gif, iniage/x-:d)itmap, imagp/jpcg image/pjpeg image/png 
Accept-Language: en 
Acccpt-Charsct: i5o-8359-l, *,utf 8 
Modified to: 



— > beginning of packet header dump < — 

— > IP header: ver8ioD-4 hdr_Jen-5 TOS-0 pkt_len-367 id-60276 

&ag_DEf=4000H TTL«64 protocol»6 cksum=1792H 

saddr-135.104.25.245 daddr-13 5.104.25.31 
— > TCP header: sport^SOOO dpoit-7000 seq»2189084427 ackL_seq=3266449517 

tcp_hdr_len=6 flag3=ACK PSH 

resl-OH res2-0H window-31856 cksum-162H uigent-0 
— > beginning of packet data dump < — 
GET http://204.71.200 :244/ft/ya/yahoomail/proniol.gif HTTP/l.O 
Referer: http://www.yahoo.com/ 
Connection: Kccp-Alive 

User-Agent: Mozilla/4.05 [en] pCll; U; Linux 2.1.103 i686) 
Host: its.yimg.com 
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Accept: image/gif, image/x-xbitmap, imagc^peg, imaged jpeg imagc^ng 
Accept-Language: en 
Accept-Charset iso-8859-1, *,utf 8 



on a GET packet that arrives at proxy redirector 104 from a 
client 101-1. The packet is destined to an origin server at a 
destination IP address (daddr) 204.71,200.244 
(www.yahoo.com) at a destination port (dport) 80, request- 
ing object /a/ya/yahoomail/promol.gif. As can be noted in 
the modified packet, the packet is redirected to a proxy cache 
110-1 at IP address 135.104.25.31 port 7000 by changing the 
daddr and dport header information. Also, the complete URL 
of the object in the GET request is modified in the translated 
packet by prefixing it with hltp://204.7 1.200.244 to form the 
absolute URL, where that prefix is obtained from the daddr 
header in the arriving packet. This transformation increases 
the length of the packet by 21 bytes so that the pkt„lcn field 
in the header is modified from 346 to 367 bytes. Further, the 
source port is modified to a bogus port number by changing 
sport to 5000- 

Table 2 shows the translations performed by the proxy 
redirector 104 to an acknowledgment from proxy cache 
110-1 to the GET request. The arriving 



length of the GET packet by 21 bytes, proxy redirector 104 
decreases the ack_seq field by the number of bytes added, 
21. Further, proxy redirector 104 translates the destination IP 
address and port number to those of the client 101-1, and the 
source IP address and port number to that of the origin 
server. The modified packet, when received by the client, 
thus appears to the cHent to have originated from the origin 
server and the ack_seq field indicates a byte number that the 
client would next expect to send having previously sent a 
packet of length 367 bytes. All packets that are subsequently 
sent through the proxy redirector 104 to client 101-1 from 
proxy cache 110-1 are similarly modified to decrement the 
ack__3eq field by the number of bytes, 21, added to the GET 
packet. 

Table 3 illustrates a next packet sent by client 101-1 to the 
origin server after the GET packet. In this packet the 
sequence number (seq) is equal to the modified ack_seq 
sent to the client, as per Table 2. The destination IP address 
and port mmiber are those of the origin server and the source 



TABLE 2 



Arriving packet: 

— > begiimmg of packet header dump < — 

— > IP header: vcrsion»4 hdr_Jeit»5 TDS=0 pkt_len=40 id=il4559 

frag_off-4000H TTL^ISS piotocoI-6 cksum-lOeH 

saddr=135.104.25.31 daddr«135.104.25.245 
— > TCP header: spoit-7000 dport=5000 seq=3266449517 ad£_scq=.2 189084754 

tcp_hdr_Jen-5 fiag^ACK 

rcsl=OH rcs2=*H wiiidow«8433 cksuin«32H urgent^ 
Modified to: 

— > begjnning of packet header dump < — 

— > IP header: veision«*4 hdr_Jen=5 TOS=0 pkt_Jen=40 id=14559 

frag__off-4000H TTL^TSS protocol-6 cksum-lOeH 

saddr^204.71.20a244 daddn.135. 104.25. 243 
~> TCP header: spOTt»80 dportol273 seq=32 664495 17 ack_seq=2189084733 

tcp_hdr_Jett-5 fiags-ACK 

rcsl«»OH rcs2=0H window^33 cksum=32H urgcntsO 



packet is addressed (daddr) to the IP address of the proxy 
redirector at the bogus port (dport) 5000 used by the proxy 
redirector for this TCP connection. The source IP address 
(saddr) and the source port (sport) are those of the proxy 
cache to where the GET request was directed. The ack^scq 
field indicates the byte number of the first byte that is 
oqjected to be sent in the packet following the GET packet. 
In this example, ack_seq is eqiial to the sequence number 
(seq) 218908427 of the GET packet plus the length of the 
GET packet, which in this case per Table 1 is 367. Thus 
ack_seq of the arriving packet is 218908754. Since cHent 
101-1 is unaware that a proxy redirector has increased the 



IP address and port number are those of the client. When 
received by proxy redirector 104, the packet is modified to 
change the source IP address and port number to the IP 
address and bogus port number of the proxy redirector. The 
destination address IP and port number are translated to that 
of proxy cache 110-1. The sequence number (seq) is 
increased by that same value of 21 to match the byte number 
that the proxy cache expects to receive based on the 
sequence number previously received in the GET packet and 
the length, 367, of the GET packet 



50 



55 



TABLE 3 



Arriving packet' 

— > begyming of packet header dump < — 

—> IP header: versioD=4 hdr_Jena5 TOS=0 ptt_len«40 id=60281 
&ag_off-4000H TTL-64 protocol-6 cksum-lSbfH 
saddr«-135.104.25.243 daddr=204.7a .200.244 
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— > TCP header: sport=1273 dport=80 scq=2l89084733 ack_scq=3266450977 
tq>_hdr_Jeno5 flags=iACK 

resl-OH resl-OH window-31856 cksum-d3f7H urgent-0 
Modified to: 

— > beginning of packet header dump < — 

— > IP header: vcision»4 hdr_lcii=5 TOS«0 pkt_lcD=40 id=60281 

frag_off=4000H TrL=64 protocol=>6 cksum=18bfH 

saddr-135.104^.245 daddp-135a 04.25.31 
— > TCP header: 8port=5000 dport=7000 $eq»21 89084754 ack_5eq=3266450977 

tq3_hdr_len=5 flags=ACK 

resl-OH res2-0H window-31856 cksum-d3f7H urgent-0 



received. All subsequent packets directed to the origin server 
from client 101-1 are similarly modified before being 
directed to proxy cache UO-1. 

In the above-description, it has been assumed that the 
length of the GET request both before modification, and 
after the URL extension is less than the maximum TCP 
segment size. In fact, the length of the GET request may be 
longer than one TCP segment. If the length of the GET 
request carrying the complete URL occupies x number of 
TCP segment and, after it is modified to carry the absolute 
URL, it still also fits into that same x number of TCP 25 
segments, then the segment carrying the URL is modified 
and overflowing characters are pipelined from one segment 
to the next. Thus, the overflowing characters fi-om a previous 
packet are prefixed to the start of the next packet, etc., until 
the last packet, which length is increased by the increased 30 
number of bytes due to the URL modification. Therefore, the 
packet lengfli of only the last segment is modified to include 
the characters that have been shifted into that segment. The 
ack_seq parameter in packets from the proxy cache to the 
chent is modified starting from the acknowledgment to the 35 
last GET packet. 

If the modification of the URL to the absolute URL could 
cause the last TCP segment of the GET request to overflow 
to another segment, a new TCP segment would need to be 
constructed and injected by the proxy redirector. The proxy 40 
redirector would then need to have the capability to retrans- 
mit this segment if it was lost. Thus, the proxy redirector 
would need to have some TCP layer functionalities. In order 
to avoid adding higher level fiinctionality to the proxy 
redirector, segment sizes are limited to less than what the 45 
proxy cache is actually capable of receiving. When the 
complete URL is transformed to an extended URL, the 
maximimi increase in size is 22 bytes, equal to the maximum 
length of an IP address of 15 bytes plus 7 bytes from the 
prefix: http://. The client is deceived to send segments whose 50 
maximum size is 22 bytes less than what the protocol allows 
it to send. The TCP segment size sent by the client is 
determined by what the proxy cache, in its handshake with 
the client, indicates as the maximum segment size it can 
receive. This is indicates by the proxy cache through the 55 
maximimi segment size (MSS) field in the ACK SYN 
packet. Accordingly, the proxy redirector 104 intercepts the 
ACK SYN packets and decreases the specified MSS amoimt 
by 22. For example, if the MSS specified by the proxy cache 
is 1460, it is modified to 1438 by the proxy redirector before 60 
being sent to the cUent When the chent next sends a GET 
request, the TCP segments are Umited to 1438 bytes. In the 
worst case, when the chent sends a GET request, 22 bytes 
will be added to the xth TCP segment that carries this 
request. The length of this xth TCP segment will still then be 65 
within the maximum length specified by the proxy cache. If 
the event that the proxy cache does not stipulate a maximimi 



MSS in the ACK SYN packet, the default used by the cHent 
is 536 bytes. An MSS option is then added by the proxy 
redirector to inform the client that the maxiinum MSS 
expected by the other end of the TCP connection^is^4 
bytes. 

As previously described, a NAT and PAT are performed 
by proxy redirector 104 on all packets addressed by cHent 
101-1 to an origin server, and all packets addressed by proxy 
cache 110-1 to proxy redirector 104 for return to the client. 
Proxy redirector 104 thus performs a NAT and a PAT on 
these packets flowing in both direction. If proxy redirector 
104 selects a proxy cache that is located in such a pomt on 
the network that packets from the proxy cache addressed 
directly to client 101-1 must pass through proxy redirector 
104 due to the network configuration, then proxy redirector 
need only perform a half-NAT on the packets flowing 
through it. SpedficaUy, if proxy redirector 104 selects a 
proxy cache such as proxy cache 117, all packets addressed 
to client 101-1 must pass through proxy redirector 104. 
Proxy redirector 104 thus only needs to transform the 
destination IP address and port number of packets from 
cHent 101-1 to the IP address and port number of proxy 
cache 117, while maintaining the source IP address and port 
number as those of client 101-1. The packets retiuned from 
proxy cadie 117 will thus be addressed to the chent' s IP 
address and port number. When they pass through proxy 
redirector 104, they are captured and the transformation of 
the source IP address and port to those of the origin server 
are the only address changes that need to be effected. 

The problems of the prior art with respect to persistent 
connections is obviated in accordance with the present 
invention. As previously noted, during a persistent connec- 
tion plural GET requests can be made by a client. In the prior 
art, as described, each GET request can result in a connec- 
tion from a proxy cache to a different origin server if the 
proxy cache does not have the requested objects. The ability 
of a server to maintain the stale of a client's connection 
throughout the duration of the connection is compromised if 
each GET request results in connections to multiple servers. 
In accordance with the present invention, once the IP address 
of the origin server is determined at the initial DNS lookup, 
that same IP address is used by the proxy redirector as a 
prefix to each complete URL in every GET request issued by 
the client throughout the duration of the persistent connec- 
tion. Thus, assuming the proxy cache does not contain any 
of the requested objects, the proxy cache will establish a 
TCP connection to the same origin server in response to each 
GET request generated by the client. It should be noted that 
if plural client GET requests are forwarded by the proxy 
redirector to a proxy cache within a persistent TCP 
connection, ack__seq parameter in packets that flow through 
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the proxy redirector from the proxy following each GET FIG. 6 illustrates the step performed by the proxy nedi- 

request must reflect the cumulative changes effected by rector for each packet destined for the origin server firom the 

translating the complete URL to the absolute URL in each of client that follows the GET request. At step 601, a packet 

the preceding GET requests within the same TCP connec- subsequent to the GET request is received from the client. At 

tion. Similarly, in all packets received by the proxy redirec- 5 step 602, a full NAT and PAT are performed. At step 603, 

tor from the client directed to the origin server within a seq_oo is increased by the amount of bytes added by 

persistent TCP connection, the scq parameter must reflect modifying the previous GET request. At step 604, the packet 

cumulative changes. is sent to the proxy. 

FIGS. 3, and 4, together are flow charts detailing the In the discussion above of FIGS. 3, 4, 5 and 6, it has been 

functions of proxy redirector 104 in establishing a TCP 10 assumed that the proxy cache is located in a position such 

connection to a proxy cache and modifying the GET request that packets directed to the client will not automatically flow 

so that such requests can be transparently forwarded to the through the proxy redirector. Thus, all packets &om the 

proxy. At step 301, a SYN packet arrives from the client at proxy are addressed to the proxy redirector. Therefore, for 

the proxy redirector At step 302, proxy redirector selects a packets flowing from the client, and packets flowing from 

proxy cache based on a load balancing algorithm or on an 15 the proxy, the proxy redirector performs a full NAT and PAT. 

arbitrary or random selection. At step 303, proxy redirector If however, as previously described, the proxy cache 

performs a full NAT, changing the daddr from that of the selected by the proxy redirector is located on the network so 

origin server to that of the selected proxy and saddr from that that all packets from proxy to the client automatically flow 

of tbe client to that of the proxy redirector. At step 304 a PAT through the proxy redirector, then, in the steps shown in 

is performed, changing sport to that of a bogus ghost port 20 FIGS. 3, 4, 5 and 6, only a half NAT needs to be performed, 

number and dport to the proxy's port number. At step 305, Although described in conjunction the programmable 

the SYN packet is sent to the proxy. In response to that SYN network element shown in FIG. 2, the proxy redirector of the 

packet, the proxy re^onds, at step 306, with a SYN ACK present invention could be implemented through other 

packet containing an MSS parameter in the TCP header. At means, using hardware, software, or a combination of both, 

step 307, a reverse translation is performed on both the IP 25 As an example, a level 4 switch having a fixed program to 

addresses and port numbers, changing saddr and sport to perform the required packet manipulations required by the 

those of the origin server and daddr and dport to those of the present invention could be used. 

client. At step 308, the MSS field is changed by reducing the As described, the proxy cache returns requestedobjects to 

value of the MSS received from the proxy by 22. At step the address fro m_which.a req uest orjgjnated.as.iodicated by 

309, the ACK SYN packet is sent to the client. At step 310, 30 the saddr an d sport-parametersin the IP.header-ioformation. 

proxy redirector receives a responsive ACK packet from the wJiich Js_lhe-address-of .the„pTOxy_iedirector^04^ the 

client. At step 311, a fiill NAT and PAT are performed on that proxy_cache_is_nQt,conDect cd on the network so that _all 

packet and, at step 312, tbe modified packet is sent to the r esponses do not autQ maticallv_i>ass_thrQU£h_the proxy 

proxy, thereby completing the handshake sequence. r edirector. The interactions b etween_the,requesting client 

At step 313, a packet containing a GET request is received 35 and,the.proxY_cache.ai e_ trans parent to. both the c ljent and the 

from the client. A full NAT is performed at step 314 and a proxyxachs. sincc-the.clicnt does not "know" that its request 

PAT is performed at step 315. A determination is made at is being redirected to the proxy by the proxy redirector, and 

decision step 316 whether this is a first packet in the GET the proxy cache, when receiving a GET request with an 

request. If yes, at step 317, the IP address of the origin server absolute URL does not know that that absolute URL is not 

obtained Is from daddr of the arriving packet is prefixed to 40 being formulated by the cUent's browser operating in a 

the complete URL in the GET request. If, at step 316, the non-transparent mode. Advantageously, the proxy cache 

packet is not a first packet in a GET request, then, at step requires no software modifications and standard proxy 

318, the overflow bytes from the previous GET padcet are caches, connected anywhere on the network can be used in 

prefixed to those bytes in the current packet and if the total conjunction with the proxy redirector. If, however, the proxy 

nimiber of byles in the resultant packet is than the actual 45 is modified, using a programmable network element as 

MSS sent by the proxy, the overflow bytes greater are previously described, for example, the requested object 

buffered for prefixing to the next packet. After alternative retrieved by the proxy from its own cache or received from 

steps 316 or 318, at step 319, a determination is made an origin server, can be sent directly back to the client, 

whether the current packet is the last packet of a GET thereby obviating the need to send such packets back to the 

request. If not, at step 320, the current packet is sent to the 50 proxy redirector for address translations and redirection to 

proxy and the flow returns to step 313 to receive the next the client. By performing only a half -NAT at the proxy 

packet in the GET request from the client. If at step 319, the redirector and leaving the client's saddr and sport as the 

current packet is the last packet in a GET message, then, at source IP address and port number in the header of the SYN 

step 321, the pkt_len parameter of that packet is changed to packet, GET request packet(sX and other packets forwarded 

reflect the change in length of the packet. At step 322, the 55 by proxy redirector 104 from the client, the proxy cache can 

modified packet is sent to the proxy. return padcets responsive to the request directly to the cHent 

FIG. 5 illustrates the steps performed by the proxy redi- by substituting the origin server's IP address and port 

rector for each padcet received from the proxy starting from number as the source address for its own address. If the 

the ACK to the GET request through the end of the con- proxy redirector performs a full NAT and PAT, then another 

nection. At step 501, the proxy redirector receives the ACK 60 mechanism must be incorporated to provide the client 

to the GET request, or any other packet that logically follows address to the proxy cache, such as incorporating the client 

the ACK to the GET request. At step 502, reverse NAT and address information as part of an appendix to the absolute 

PATs are performed, translating daddr and dport to those of address in the modified GET request and stripping the 

the client and saddr and sport to those of tte origin server. appended client address information at the proxy before 

At step 503, ack_seq is decreased by the amount added in 65 determining whether the requested object is stored in the 

the preceding GET request. At step 504, the modified packet cache or whether a connection to the origin server need be 

is sent to the client. made. Advantageously, by sending the packets from the 
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proxy cache directly back to the client, the delay encoun- 
tered by transmitting such packets back to the proxy redi- 
rector for address translation and redirection is eliminated. 
Disadvantageously, the proxy cache must be modified to 
perform these functions, precluding use of standard avail- 
able proxy caches. 

Although described hereinabove in connection with GET 
requests, the present invention can equally be applied to 
redirection of any type of request message in which the 
token is, for example, GET, POST or HEAD, or any other 
type of token yet to be implemented and/or standardized. 

The foregoing therefore merely tUuslrates the principles 
of the invention. It will thus be appreciated that those skilled 
in the art will be able to devise various arrangements which, 
although not explicitly described or shown herein, embody 
the principles of the invention and arc included within its 
spirit and scope. Furthermore, all examples and conditional 
language recited hereinabove are principally intended 
expressly to be only for pedagogical purposes to aid the 
reader in understanding the principles of the invention and 
the concepts contributed by the inventors to furthering the 
art, and are to be construed as being without limitation to 
such specifically recited examples and conditions. 
Moreover, all statements hereinabove reciting principles, 
aspects, and embodiments of the invention, as well as 
specific examples thereof, are intended to encompass both 
structural and functional equivalents thereof. Additionally, it 
is intended that such equivalents include both currently 
known equivalents as well as equivalents developed in the 
future, i.e., any elements developed that perform the same 
function, regardless of structure. 

Thus, for example, it will be appreciated by those skilled 
in the art that the block diagrams and flowcharts described 
hereinabove represent conceptual views of illustrative cir- 
cuitry and processes embodying the principles of the inven- 
tion. Similarly, it will be appreciated that any flowcharts, 
flow diagrams, state transition diagrams, pseudocode, and 
the like represent various processes which may be substan- 
tially represented in computer readable medium and so 
executed by a computer or processor, whether or not such a 
computer or processor is explicitly shown. 

The functions of the various elements shown in the FIGS., 
including functional blocks labeled as "processors'* may be 
provided through the use of dedicated hardware as well as 
hardware capable of executing software in association with 
appropriate software. When provided by a processor, the 
functions may be provided by a single dedicated processor, 
by a single shared processor, or by a plurality of individual 
processors, some of which may be shared. Moreover, 
explicit use of the term "processor"' or "controller" should 
not be construed to refer exclusively to hardware capable of 
executing software, and may implicitly include, without 
limitation, digital signal processor (DSP) hardware, read- 
only memory (ROM) for storing software, random access 
memory (RAM), and non- volatile storage. Other hardware, 
conventional and/or custom, may also be included. 
Similarly, any switches shown in the FIGS, are conceptual 
only. Their function may be carried out through the opera- 
tion of program logic, through dedicated logic, through the 
interaction of program control and dedicated logic, or even 
manually, the particular technique being selectable by the 
implementor as more specifically understood from the con- 
text. 

In the claims hereof any element expressed as a means for 
performing a specified fiinction is intended to encompass 
any way of performing that function including, for example, 
a) a combination of circuit elements which performs that 
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fiinaion or b) software in any form, including, therefore, 
firmware, microcode or the like, combined with appropriate 
circuitry for executing that software to perform the function. 
The invention as defined by such claims resides in the fact 

5 that the functionalities provided by the various recited 
means are combined and brought together in the maimer 
which the claims call for. Applicant thus regards any means 
which can provide those functionalities as eqmvalent to 
those shown hereinabove. 

10 What is claimed is: 

1. A method at a Layer 4 switch for redirecting an HTTP 
connection request from a client that is directed to an origin 
server to a proxy cache over a packet-based computer 
network comprising the steps of: 

15 receiving at least one packet containing a request message 
within the HTTP connection, the at least one padcet 
having an IP header comprising a destination address of 
an origin server, the request message including a com- 
plete address of a specified object at the origin server; 
modifying, at the packet level, the request message by 
combining the destination address of the origin server 
with the complete address; and 
forwarding the at least one packet containing the modified 
request message to the proxy cache over the packet- 
based network; 
wherein the proxy cache is a standard proxy cache and, in 
the forwarding step, the at least one packet containing 
the modified request is redirected to the standard proxy 
cache transparently from the standpoints of both the 
client and the proxy cache, the proxy cache being able 
to determine whether it can satisfy the request for the 
specified object by using the combined destination 
address of the origin server and the complete address 
contained in the modified request message. 

2. The method of claim 1 wherein the step of modifying 
comprises the step of prefixing the complete address in the 
request message with the destination address of the origin 
server. 

^ 3. The method of claim 2 further comprising the step of: 
translating the destination address in an IP header of the 
at least one packet containing the modified request 
message to the address of the proxy cache. 

4. The method of claim 3 further comprising the step of: 
45 translating a source address in the IP header of the at least 

one packet containing the modified request message 
from an address of the client to an address of the Layer 
4 switch. 

5. The method of claim 3 further comprising the step of: 
50 in a TCP header in packets received from the proxy cache 

conamencing with an acknowledgment to the modified 
request message, modifying a value in an acknowl- 
edged byte sequence number field to reflect the 
increased number of bytes in the modified request 
55 message resulting from the step of prefixing the com- 
plete address with the destination address of the origin 
server. 

6. The method of claim 5 wherein the step of modifying 
the value in the acknowledged byte sequence number field 

60 comprises the step of decreasing the value in the acknowl- 
edged byte sequence number field by the number of bytes 
added by prefixing the destination address of the origin 
server to the complete address in the request message. 

7. The method of claim 6 further comprising the step of: 
65 translating in the IP header the destination address to the 

address of the chent and the source address to that of 
the origin server in the packets received from the proxy 
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cache commeocing with the acknowledgment to the 
request message. 

8. The method of claim 3 further comprising the step of: 
modifying a value in a sent byte sequence number field to 

reflect the increased number of bytes in the modified 
request message resulting from prefixing the complete 
address with the destination address of the origin server 
in a TCP header in packets received from the client 
following receipt of the request message. 

9. The method of claim 8 wherein the step of modifying 
the value in the seot byte sequence number field comprises 
the step of increasing the value in the sent byte sequence 
number field by the number of bytes added by prefixing the 
destination address of the origin server to the complete 
address in the request message. 

10. The method of daim 9 further comprising the step of: 
translating the destination address to the address of the 

proxy cache in the IP header in each packet received 
fi-om the client following receipt of the request mes- 
sage. 

U. The method of claim 2 further comprising, prior to the 
step of receiving the at least one packet containing the 
request message, the steps of: 

receiving a packet from the proxy cache indicating a 
maximum segment size that packets sent to it thereajfter 
should have; 

reducing by a predetermined number the maximum seg- 
ment size of packets to be thereafter sent to the proxy 
cache; and 

sending a packet to the client indicating the reduced 
maximum segment size that packets thereafter sent by 
the client to the origin server should have. 

12. Tbe method of claim 11 wherein the predetermined 
number is equal at least to the maximum number of bytes 
added by the step of prefixing the destination address to the 
complete address in the request message. 

13. The method of claim 12 further comprising the steps 

of: 

successively shifting to a next packet in the request 
message overflow bites caused by the step of prefixing 
the destination address to the complete address in the 
request message, if the request message is contained 
within more than one packet; and 

changing a length-of-packet parameter in the IP header in 
a last packet of a multi-packet request message to 
reflect the bytes shifted into the last packet from a 
previous packet. 

14. The method of claim 12 further comprising the step of 
changing a length-of-packet parameter in the IP header to 
reflect the change in the length of that packet caused by the 50 
step of prefixing the destination address to the complete 
address in the request message, if the request message is 
contained within one packet. 

15. The method of claim 1 wherein the request message 
is a GET request. 

16. The method of claim of claim 1 further comprising the 
step of selecting the proxy cache &om a plurality of different 
proxy caches prior to receiving the at least one packet 
containing the request message. 

17. The method of claim 1 wherein the request message 
is a POST request. 

18. The method of claim 1 wherein the request message 
is a HEAD request. 

19. A proxy redirector for redirecting an HTTP connection 
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means for receiving at least one packet containing the 
request message, the a least one packet having an IP 
header comprising a destination address of an origin 
server, the request message including a complete 
address of a specified object at the origin server; 

means for modified the request message by combining the 
destination address of the origin server with the com- 
plete address at the packet level; and 

means for forwarding the at least one packet containing 
the modified request message to the proxy cache over 
the packet-based network; 

wherein the proxy cache is a standard proxy cache and the 
means for forwarding redirects the at least one packet 
containing the modified request to the standard proxy 
cache transparently &om the standpoints of both the 
client and the proxy cache, the proxy cache being able 
to determine whether it can satisfy the request for the 
specified objea by using the combined destination 
address of the origin server and the complete address 
contained in the modified request message. 

20. The proxy redirector of daim 19 wherein the means 
for modifying comprises means for prefixing the complete 
address in the request message with the destination address 
of the origin server. 

21. The proxy redirector of claim 20 further comprising: 
means for translating the destination address in an IP 

header of the at least one packet containing the modi- 
fied request message to the address of the proxy cache. 

22. The proxy redirector of claim 21 further comprising: 
means for translating a source address in the IP header of 

the at least one packet containing the modified request 
message ftom an address of the client to an address of 
the proxy redirector. 

23. The proxy redirector of claim 21 further comprising: 
means for modifying a value in an acknowledged byte 

sequence number field in a TCP header in packets 
received from the proxy cache, commencing with an 
acknowledgment to the modified request message, to 
reflect the increased number of bytes in the modified 
request message resulting from prefixing the complete 
address with the destination address of the origin 
server. 

24. The proxy redirector of daim 23 wherein the means 
for modifying the value in the acknowledged byte sequence 
number field decreases the value in the adaiowledged byte 
sequence number field by the number of bytes added by 
prefixing the destination address of the origin server to the 
complete address in the request message. 

25. The proxy redirector of claim 24 further comprising: 
means for translating in the IP header the destination 

address to the address of the client and the source 
address to that of the origin server in the packets 
received from the proxy cache commendng with the 
acknowledgment to the request message. 

26. The proxy redirector of claim 21 further comprising: 
means for modifying a value in a sent byte sequence 

number field to reflect the increased number of bytes in 
the modified request message resulting from prefixing 
the complete address with the destination address of the 
origin server in a TCP header in packets received from 
the client following receipt of the request message. 

27. The proxy redirector of claim 26 wherein the means 
for modifying the value in the sent byte sequence number 
field increases the value in the sent byte sequence number 



request from a client that is directed to an origin server to a 65 field by the number of bytes added by prefixing the desti- 
proxy cache over a packet-based computer network com- nation address of the origin server to the complete address 
prising: in the request message. 
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28. The proxy redirector of claim 27 further comprising: the modified request is redirected to the standard proxy 
means for translating the destination address to the cache transparently from the standpoints of both the 

address of the proxy cache in the IP header each packet client and the prc>xy cache, the proxy cache being able 

received from the client following receipt of the request to determine whether it can satisfy the request for the 

message. 5 specified object by using the combined destination 

29. The proxy redirector of claim 20 further comprising: address of the origin server and the complete address 
means for receiving a packet from the proxy cache contained in the modified request message. 

indicating a maximum segment size that packets sent to 39. The computer readable memory of claim 38 wherein 

it thereafter should have* the step of modifying comprises the step of prefixing the 
means for reducing by a prietermined number the maxi- lo complete address in the request message with the destination 

mum segment size of packets to be thereafter sent to the address of the ongm server. 

proxy cache* and computer readable memory of claun 39 wherein 

means for sending a packet to the client indicating the ^^^^ computer program instructions further comprise 

reduced maximum segment size that packets thereafter instructions definmg the step of: 

sent by the client to the origin server should have. . . translatmg the desunauon address m an IP header of the 

30. The proxy redirector of claim 29 wherein the predc- least one packet containing the modified request 
termined number is equal at least to the maximum number message to the address of the proxy cache. 

of bytes added by the prefix of the destination address to the computer readable memory of claun 40 wherein 

complete address in the request message. said computer program instructions further comprise 

31. The proxy redirector of claim 30 further comprising: mstrucUons definmg the step of: 

means for successively shifting to a next packet in the ^° translating a source address in the IP header of the at least 

request message overflow bytes caused by the prefix of packet oontammg the modified request message 

the destination address to the complete address in the from^ address of the chent to an address of the Diyer 

request message, if the request message is contained ^^^^^^^^ ^ ^^^^ ^ ^^^^^ 

withm more than one packet; arid ^^.^ computer program instructions further comprise 

means for changing a length-of-packet parameter m the IP instructions defining the step of: 

header in a last packet of a multi-packet request mes- ^ ^P header in packets received from the proxy cache 

sage to refiect the bytes shifted into the last packet from commencing with an acknowledgment to the modified 

a previous packet, request message, modifying a value in a acknowledged 

32. The proxy redirector of claim 30 wherein the proxy byte sequence number field to refiect the increased 
cache further comprises means for changing a length-of- number of bytes in the modified request message 
packet parameter in the IP header to reflect the change in the resulting from the step of prefixing the complete 
length of that packet caused by the prefix of the destination address wi]h the destination address of the origin 
address to the complete address in the request message, if server. 

the length of the request message is contained within one 43. The computer readable memory of claim 42 wherein 

packet, the step of modifying the value in the acknowledged byte 

33. The proxy redirector of claim 19 further comprising sequence number field comprises the step of decreasing the 
means for selecting the proxy cache from a plurality of value in the acknowledged byte sequence number field by 
different proxy caches. the number of bytes added by prefixing the destination 

34. The proxy redirector of claim 19 wherein the means ^ address of the origin server to the complete address in the 
for modifying the request message is a gateway program request message. 

dynamically loaded on a programmable network element. 44 Jhe computer readable memory of claim 43 wherein 

35. The proxy redirector of claim 19 wherein the request said computer program instructions further comprise 
message is a GET request. instructions defining the step of: 

36. The proxy redirector of claim 19 wherein the request translating in the IP header the destination address to the 
message is a POST request. address of the cHent and the source address to that of 

37. The proxy redirector of claim 19 wherein the request the origin server in the packets received from the proxy 
message is a HEAD request. ^^^he commencing with the acknowledgment to the 

38. A computer readable medium storing computer pro- request message. 

gram instructions which are executable on a computer 45. The computer readable memory of claim 40 wherein 

system implementing a Layer 4 switch for redirecting an 5^^^ computer program instructions further comprise 

HTTP connection request from a client to a proxy cache over instructions defining the step of: 

a packet-based computer network, said computer program modifying a value in a sent byte sequence number field to 

mstnicuons compnsmg instrucUons definmg the steps of: ^^^^j increased number of bytes in the modified 

receiving at least one packet containing the request 55 request message resulting from prefixing the complete 

message, the at least one packet having an IP header address vnth the destination addrcssof the origin server 

comprising a destination address of an origin server, the ^ ^ j^P header in packets received from the cHent 

request message including a complete address of a foUowing receipt of the request message, 

specified object at the origin server; 45 jhe computer readable memory of claim 45 wherein 

modifying, at die packet level, the request message by 50 the step of modifying the vahie in the sent byte sequence 

combining the destination address of the origin server number field comprises the step of increasing the value in 

with the complete address; and the sent byte sequence number field by the number of bytes 

forwarding the at least one packet containing the modified added by prefixing the destination address of the origin 

request message to the proxy cache over the packet- server to the complete address in the fequest message, 
based network; 6S 47. The computer readable memory of claim 46 wherein 

wherein the proxy cache is a standard proxy cache and, in said computer program instructions further comprise 

the forwarding step, the at least one packet containing instructions defining the step of: 
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translating the destination address to the address of the 
proxy cache in the IP header each packet received from 
the client following receipt of the request message. 
4S. The computer readable memory of claim 39 wherein 
said computer program instructions further comprises 5 
instructions defining, prior to the step of receiving the at 
least one packet containing the request message, the steps of: 
receiving a packet from the proxy cache indicating a 
maximum segment size that packets sent to it thereafter 
should have; 10 
reducing by a predetermined number the maximum seg- 
ment size of packets to be thereafter sent to the proxy 
cache; and 

sending a packet to the client indicating the reduced 
maximum segment size that packets thereafter sent by 
the client to the origin server should have. 

49. The computer readable memory of claim 48 wherein 
the predetermined number is equal at least to the maximtun 
number of bytes added by the step of prefixing the destina- 
lion address to the complete address in the request message. 

50. The computer readable memory of claim 49 wherein 
said computer program instnictions further comprise 
instructions defining the steps of: 

successively shifting to a next packet in the request 2S 
message overflow bytes caused by the step of prefixing 
the destination address to the complete addr^ in the 
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request message, if the request message is contained 
within more than one packet; and 
changing a leogth-of-packet parameter in the IP header in 
a last packet of a multi-packet request message to 
reflect the bytes shifted into the last packet from a 
previous packet. 

51. The computer readable memory of claim 49 wherein 
said computer program instructions further comprise 
instructions defining the step of changiDg a length-of-packet 
parameter in the IP header to reflect the change in the length 
of that packet caused by the step of prefixing the destination 
address to the complete address in the request message, if 
the request message is contained within one packet. 

52. The computer readable memory of claim of claim 38 
wherein said computer program instructions further com- 
prise instructions defining the step of selecting the proxy 
cache from a plurality of different proxy caches prior to 
receiving the at least one packet containing the request 
message. 

53. The computer readable memory of claim 38 wherein 
the request message is a GET request. 

54. The computer readable memory of claim 38 wherein 
the request message is a POST request. 

55. The computer readable memory of claim 38 wherein 
the request message is a HEAD request. 

♦ * » ♦ * 
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[57] ABSTRACT 

Methods and apparatus for accelerating a route r in a com - 
mun ifftioDS_network are, described ^Jg one ena bodimenU a 
r outer accelerator includes arforwarding table^for ass ocia- 
tively storing a d estination addres s and a ne xt bop addre ss. 
If a destination address of a packet matches a destination 
address in the f orwarding tattl e, then logic forwards the 
packet to a next hop. A i putej may be'^sjupiea lo at least one 
network port_tbrp.ugh_lhe router accelerator. In another 
approach, the network includes at least one host a nd at least 
onejQuler. The host has at least one r outing table for 
assoeiatively storing a s econd-level destination address , a 
se cond-level next hop address and a first-lev el next hop 
address. A router accelerator includes red irect logic for 
stonng_t he second-leve l desti nation addr ess in a second- 
level next hop address entry in the at least one host routing 
table. This causes the host to request a first- level next hop 
address corresponding to the second -level destination 
address. An accelerator table responds to the host's request 
with the bound first-level next hop address. In another 
approach employing host routing tables, a router accelerator 
includes request logic for requesting from the router a 
second-level next hop address in response to the first-level 
next hop address. ttp-HiVsct lo gic stores the second-level nex t 
h op address in a second-level next hop address entry in the 
at least one host routing table . This causes the host to request 
a first-level next hop address corresponding to the s econd- 
levdneYt ho p address . 
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METHOD AND APPARATUS FOR routers maintain IP routing tables to detennine where to send 

ACCELERATING OSI LAYER 3 ROUTERS a datagram based upon the IP destination address. TVpically 

a routing table contains pairs (N, R), where N is the IP 

BACKGROUND address of a destination network or host, and R is the IP 

1 c- 1^ ♦u^ T«„^«t;«„ 5 address of the next router along the path to network or host 

1. Field 01 the Invention vi t .l- * J * i i * *i_ 

^ , ^ . N. In this manner, a router need not know the complete path 

The present invention relates to the field of routmg ^o a destination node, only the next hop. The next hop in a 

messages in a communications network, and m particular to machine's routing table must lie in a physical network to 

mcreasmg the throughput of a routing system. ^^^^^ machine connects directly. To assure selection of 

2, Description of the Related Art lo the proper next hop, routers exchange routing tables to keep 
In recent years, there has been an exponential increase in track of changes in network configuration. 

the demand for bandwidth in communications networks. After executing a routing algorithm to obtain the next hop 

This increase is due to a variety of factors, including an ip addi^ in response to the ultimate destination address of 

increase in the number of users as the Internet moves a datagram, the router passes the datagram and the next hop 
towards becoming a mass communications medium, and an is address to network interface software in the router respon- 

increase in bandwidth-intensive multimedia applications sible for the physical network over which the datagram must 

that integrate still images, video and speech with data. To be sent. The network interface software binds the next hop 

keep up with this increasing demand, communications address to a physical address, fonns a frame using that 

facilities must frequently be upgraded. physical address, places the datagram in the data portion of 

Data routers are critical components in data communica- the frame, and sends the resulting frame to a second (next 

tions networks. Routers link physical networks along the hop) router over the physical network linking the next hop 

path from a source node to a destination node. In a network router to the first router. This process continues until the 

employing packet switching, data packets are passed from datagram reaches a final router that is connected directly to 

router to router until they reach their final destination. This the same physical network as the destination. At that point, 
process is known as "next hop routing." ^ the final router wlQ dehver the datagram using direct deliv- 

FIG. 1 illustrates a conventional communications network ^^y* 

of the type that employs next hop routing. In this example. After using the next hop address to find a physical 

a router 100 couples three networks 102. Each network 102 address, the network interface software discards the next hop 

includes at least one host computer 104. Software in the address. If the host is sending a sequence of datagrams to the 

router 100 implements an Internet protocol (IP) routing same destination address, this process of determining the 

algorithm that determines how to send an IP datagram across next hop IP address and then the next hop physical address 

thenetworks. The Internet protocol operates at layer 3 of the is repeated, even though it appears very inefficient. As is 

well-known layered OSI standard. For a host 104 to transfer weU known in the art, the binding between the next hop IP 

an IP datagram to another host 104, the sender encapsulates address and the physical address is not saved because of the 

the datagram in a physical frame ("packet"), wherein the philosophy underlying the Internet protocol. The protocol 

destination IP address is ma pped to a ph^ cal (layer 2) builds an abstraction that hides the details of one network 

address. If the source and destination hosts lie within the layer from another, thereby maintaining the distinction 

same pnysical network, such as an Ethernet, then the source between the IP and physical address layers, 
can send the datagram directly to the destination over the ^ Because router functions, such as exchanging routing 

physical network. tables and executing routing algorithms require the flexibil- 

FIG. 2 illustrates the format of an IP datagram 200 ity of software to react to a changing environment, these 

encapsulated in a physical frame 202. The frame 202 functions are implemented in software rather than hardware, 

includes a frame header 204 followed by frame data 206 and Thus, the capacity of routers is limited to the capacity of the 

a cyclic redundancy check code 208. The IP datagram 200, microprocessors and the software that implement the nec- 

which is encapsulated as frame data 206, includes a data- essary algorithms. 

gram header 210 and datagram data 212. order to increase data throughput, some routers have 
FIG. 3 illustrates the format of a physical frame header been designed in hardware to overcome the limitations of 
204, such as an Ethernet frame header. The frame header 204 software. For example, Rekhter describes systems that con- 
includes a preamble 300, a destination address 302, a source tain local forwarding tables in switches, known as "forward- 
address 304, and a frame type field 306. The preamble is ing information bases." Each table entry includes an ulti- 
employed for synchronization purposes. The destination mate destination address and a next hop destination address, 
address 302 contains the physical (layer 2) address of the The forwarding information base is constructed from infor- 
destination node, which is unique to each hardware unit and mation contained in a "routing information base" that in turn 
is hard-coded into the hardware at the time of manufacture. 55 is constructed from routing infonnation received from the 
The source address 304 is the physical address of the source network. For further information please refer lo Y. Rekhter, 
node. "Inter-domain routing: EGP, BGP, and IDRP," in M. 

FIG. 4 illustrates the format of an IP datagram header 210, Steenstrup, Routing in Communications Networks, First 

showingonly fields that are re lev ant to the present invention. Edition, 1995, pp. 99-133. Forwarding tables are also 
The fields include a time to live field 400, an IP source 50 employed in U.S. Pat. No. 5,566,170 issued to Bakke et al. 

address 402, and an IP destination address 404, For further These and all other references referred to herein are incor- 

information regarding frame formatting and networking in porated by reference herein. 

general, please refer to D. Comer, Internetworking With Further, other hardware- accelerated routers cache next 

TCP/IP" Volume 1, Prentice-Hall. Third Edition, 1995, hop informationTAilhough these hardware routers increase 
which is incorporated by reference herein. 55 throughput, replacing existing routers with these advanced 

If the source and destination lie in different networks, then routers is an expensive proposition. The expense is not just 

the router is employed to effect the transfer. Both hosts and the cost of the router itself, but the administrative costs 
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incurred in configuring the new router and the cost of address and first-level next hop address. The accelerator 

associated network downtime. Accordingly, it is desired to table re^onds to the host!s-cequest-wttb4h fii?ouDd firs t-level 

find a more cost-effeaive way of upgrading existing routing next hop-address, 
systems to increase data capacity. 



SUMMARY OF THE INVENTION 



BRIEF DESCRIPTION OF THE DRAWINGS 



Fl G. 1 illustrates a conventional communications network 

The present invention provides methods and apparatus for jyp^ j^at employs next hop routing, 

accelerating a router in a communications network. In one _ - •« , , r , r td j ♦ 

. ° , 1 * • 1 J r J- FIG. 2 illustrates the format of an IP data gram m 

embodunent, a router accelerator inclu des a lorward mg t . ^ • i. ■ i r 
tab le for associativelv storing a destination addn: ss and a lo capsuiated m a physical frame, 

n ext hop addr ess. If a destination address of a packet FIG- ^ illustrates the format of a physical frame header, 

m atches a destination address in the fonvarding tah kuJhfia FIG. 4 illustrates the format of a IP datagram header, 

lo ^ forwards the packet to a next hop . The destination FIG. 5 is a communications network incorporating a 

address may be a network address, and the next hop address router accelerator switch of the present invention, 

ma^ t?e a physicaLaddfpss. i5 ^ illustrates a router accelerator switdi according to 

A router may be coupled to at least one network port the first embodiment to the present invention, 
through the router accelerator. Ihe router binds the next hop p,Q y iu^strates the data structure for a packet forward- 
address to the destmation address of the packet if the packet ^ ^^^^^ ^^^^ invention, 
destination address does not match a desti nation add ress in „ .„ .... . r . t.- . 
the forwarding table. THe" router does not process a pa cket if ^0 FIG 8 illustrates the data structure of a router history 
the packet destin ";tion matchesJa the forwarding table. ^^^^ P^^°^ invention. 

A router history table associatively stores at least one 9 is a flow chart mustrating the operation of the 

destination address and a. corresponding address of the '^^^^^ accelerator switch of the present mvention. 

network port that received a packet having that destination FIG. lOA illustrates a first host routing table employed by 

address. Each forwarding table may be associated with a the present invention. 

network port. The accelerator stores the next hop address in FIG. lOB illustrates a second routing table employed by 

the forwarding table for a particular port if that port is the present invention. 

associated in the history table with a destination address that pjG n a flow chart illustrating the operation of a 

is bound to the next hop address. second embodiment of the present invention. 

In another embodiment, the network includes at least one pjQ ^ illustrates an accelerator table employed by the 

host and at least one router. The host has at least one routing host-syntric embodiments of the present invention, 

table for associatively storing a sg cond-level d^ Unatioii ^3 ^ iflustraling the operation of a third 

address^ s^ond-level next hop addre ^nd a firsNlevel ^^^^^^^ ^ j^^^^^ion. 

neXTE^i-aadFess. The router bmds a first-level next hop , t r , • • 

address to a second-level destination address. A router FIG. 14 dlustrates a RARP table of the present invention, 
accelerator includes recfirect logic for storing the secon d- 
level destination address in a second-level next hop addr ess 
p.nty-tn^flj^ ^^ [|ef^«ft jMieJiQSt routing table. This causes the 
host lo request a first-level next hop address corresponding ^ The present invention provides meth ods and apparatus fo r 

to the second-level dest ination addre ss. An accelerator table accel erating OSI la yer JL-routing^In the following 

associatively stores tEc^bound sec ond-level desti nation desCT^tion, numerous details are set forth in orderlo enable 

address and fi rst-level tieit~hop addres s. The accelerator a thorough understanding of the present invention. However, 

table responds to the host's request with the bound first-level it will be understood bv_tho se_of ordinarv_skill to the art that 

next hop address. th ese spedficdetails are not required in o^^ to practice the 

The first-level addresses may be physicaXaddresses, and invention. Further, well-known elements, devicesrprocess 

the second-level-addressesmay be network addresses. The steps and the like are not set forth in detail in order to avoid 

router may be coupled to at l easTonenetwo rk port through obscuring the present invention. 

the router accel erator. The present invention provides a means whereby existing 

The host associatively stores the bound second-level 50 routing systems may be upg raded to.increasetto at 

destination address and first-level next hop address in a host low cost H G._ 5j lllustia to_a _commun^ network 

forwarding table. T he hos t forwards a packet to a next h op incorporating ^ router accelerat or switch S^^J^f the present 

if a secondrleyeLdestina tion Addreg_of the pa c to invention. The neiworics ou^, me swiicii oWTand the router 

a secQndrle>¥Ldesti natiQxuadciiass^ ^ 504 may form j^vjde-ffea ^nctwQrk.thaLcoM^ 
table. A router coupled to die accelerator dcwis^noTproccss 55 of g'eogra . phicallv^-d ispersed networks together, 

the packet if the second-level destination address of the Alternatively, the networks 502 may be local-area networks 

packet matches a second-level destination address in the at a given sit e, perhaps in different buildings or^of different 

host forwarding table. types, sucn as a fiber-optic token ring network or an Ether- 

In another embodiment employing host routing tables, a net. In the description that follows, it wiU be assumed for 
router accelerator includes request logic for requesting from 60 simplicity tha> the networks are Ethernet local area net- 

the router a second-level next hop address in response to tiie works. Further, although three networks 502 are illustrated, 

firet-level next hop address, RedirecUo gic storesJ hfLSCcond- many more networks would be connected in most actual 

leyeLnaxi-iiop address in a second-level next hop addres s installations. 

entr y in the at least one host routing tab le. This causSlhe The router accelerat or switch 500 of the present invention 
host to request a first-level next hop address corresponding 65 is independent Qf-the-desi^n nt the rn^^^ "^^e mutiny? 

to the second-level next hop address. An accelerator ta bles algorithms and the network topology. The accelerator 500 

associativelyjtores.th e_oon :csp^ next ho p mav_bejnstalledJn-ai :onventional routing svstem bv simpl y 
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unplugging the router 504 from the network connections, ciativejy ^ores the IP address of the ultimate destination of 

connecting the accelerator to those connections, and plug- a padceUjlheJayCTjjph ysical) addr oflhe^oe^jhop, an 

ging the router 504 into the accelerator i pfitch 500 . Through indicator'ortHrnetwork-sid 

the operation described below, the acc elerator switch 500 is (cJesfinalwnpoR field indicating 

trans parent to both the networks 502 and the router 504, 5 howlo^g the address entry isJcusm ai p in the t ab le. Becau se 

e xcept thatjhe rou ter 504 will appear to have a much larger the switchjnusLbeJranspare nt to the router andlhe ne twork, 

capacity foir ^vvilcyi mg'p'ackets. th c^destination .por t is the same port at which the pad^w as 

The distinction between a router and a switch is somewhat re ceived from the router side., 

arbitrary, but generally lies in the level of sophistication of FIG. 8 ilhistrates the data structure of the router history 

the mechanisms used to determine the next hop destination, 10 t able 610. The router history tal5Ic~610 ai;aociativMy s tores 

and the extent to which the system communicates with other th e IP destination address along with ttie pQrt_at wnich the 

parts of the network to exchange information from routing ro uter received a packet having the associ ated IP destination 

tables. For purposes of this application, a router performs the addcess. 

function of calculating a next hop destination by coj nmu- Referring to the flow chart of FlGr9, the rputeiialQ^lera- 

nicating with other parts of the network to monitor^the is ^j. g^sritch of thc^prc^njLjnygntiQg jDpc^ as follows. The 

n ejwork _co_nfiguration . Conversely," Tor purposes of the switcb^y5ts2Qr-.aj?.ack^t .,t(^anive^eiA 

application, a conventional s witch is assumed not to perfon n port or a router-side port (step 900). When-a«packet is 

any calciilfltinns tn det ermine the route of a packet , but rec6isp.d.QxeijQLnQj3fJ^^^ being 

simplyJorwapl s the packet according to next hop inform a- di rected to the router as a next hop , it is jnteroept ed by the 

t ion proy ided-hjjth e-coiujtej:^ ^ fp uter accelerator s\^ a^cl i,aUhej^ 

According to the present invenfion^ the switch of the p ort input. The^PraL 60y1hep^^ 

invention is connected between the network ports and the add ress from the packet fram e (step 904) and forwards it to 

routing system i n order to intercept packets transmitted to the partes padcfrt-fnTwatding table for comparison with the 

a nd from the router::S &!s adtch acts as a "shadow router" t o I P destination addresses s tored in the CAM of the PFTfeteps 

observe and mimic some of_the router's functions. Bv doing 906. 9081 The packet pro ccssingJogjc 604 also instruc ts the 

so, the switc h need not receive instructions from the ro uter m emory controller to stotejhe packet in the shared memory 

in order to rmite every j acket.^ paclKL5|Dage. 

In one embodiment, the switch recor 4s^he_a ext^op If the IP des tinatioa address does not match an address in 

ph^gkal addre ss computed _b v. the^jiPuter for a packe t. The t he packet forwarding table, then the packet processingl ogic 

switcG" then u supK. the next hopi routing faLrK:tio_p of t he in structs the memory controller 0V6r_th6_cpn trjQLbus to 

r outer for suteequept packets directed ^ to4 h e same dcsti oa- transmit the packe£from _shar^_mcmor y to jhe^corrc spond- 

tigiL_ ing router^^irtjp^x lt^^^ so_ t hat the 

FIG. 6 i llustrates a router accelerator switch gg)accord - packet may be processed by the router ^e^910V 

ing ja-aJaist_embodimenl-QLth£LipiSSeat_inyj?.n^^ The 35 Upon receiving the packet, the router performs its usual 

switchJncludesrthree=,ports,4([^ connected to three c orre- function of computing the next hop layer 2 (physical) 

spondin g networks, (not shown), and three ports 602 con- address. After doing so, the router transmits the packet back 

nected to concsponding ports of a router (not shown). The to the switch oyer the appropriate router-side port leading to 

ports correspond one-to-one wit h each other to maintain the the next hop destination (steps 900, 902). The packet 

transparency of the switch. E ach port in cludes packet pro- 4Q returned to the switch_b v the router is encapsula ted in a 

c essing logic (PPL i-£6fi 4 on network sgerWg^i^r^ Iter physi cal frame havin g the next h op layer 2 address in the 

,.sMS)-fe)r handling layM^and layer 3 communicatioiK with, frame header. The router-side packet processing logic 606 of 

for example, an.Ethemet network. The PPL circuitry may be the switch at the port receiving the returning packet instructs 

irap_lemejite d_using standard Ethernet interf ace chips. the memory controller to store this packet in shared memory. 

— Each portj6ia.on the network side is coupled to a first 45 Further, under instructions from the router-side packet pro- 

c oDtent addressable memory 608 (CAM^ that implenStT a cessing logic 606, the memory controUcr also causes the IP 

p acket forwarding table rPFT^ of the^ invention. On the destination address and iiq)ut port identifier to be stored in 

router side, the switch includes a table 611LJinpkm£lUed ^^ter history table (step 912). 

with a CAM , which acts as a router history table of the The router-side PPL 606 then extracts the IP destination 

invention. Shared memory packet storage 612 acts as a 50 address from the packet header (step 914). The PPL then 

queue for storing packets, with one cueue per router/ causes the destination address to be compared to the 

ne twork po rt. The pac ket storage is shared by both the router addresses in the router history table (step 916). If there is a 

po rts and the netwo rk.pHtts. A micropitxessorCSl^xecutes hit, then this indicates that the packet was previously sent to 

pro grams stor edln" flash memory 616 and uses DRAM the router by the router accelerator switch, 

memory 618 for temporary storage. A memory controller 55 In response to a hit, the network-side packet processing 

620 controls transfer of the packets among the router ports logic instructs the memory controller to extract the IP 

602, the network ports 600 and the shared memor^^US. The destination address and the layer 2 next hop address from the 

memory controller 620 keeps track of the locatioa.Q£jiackets packet stored in shared memory. The memory controller 

i n memo ry. It is also responsible for no tifyin g the m icro- then stores this data in the packet forwarding table (or tables) 

proc essor of an overflow . Control signals are transferred go for the ports indicated by the router history table as corre- 

throughout the swit ch over a control J jus 622.-,Data^is sponding to the associated IP destination address (or 

transfenceS.betwee n the shared m emory 612.and,the network addresses) (step 918), After the next hop information is 

p orts _6 0Q.oycr,ajfiist.da ta bus 62 4. whereas data is trans- stored in the packet forwarding table, the router history table 

ferred between the router ports 602 and shared memory 612 deletes the entry for the corresponding destination address 

over a second data bus 626. 55 (step 920). 

FTG. 7 illustrates the data structure for the packet for- The router-side packet processing logic 606 forwards the 

warding table 608. The packet forwarding table j08 asso- ^ returned packet to the next hop destinatioa through the 
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network port corresponding to the router port serving as essentially the same as that of FIG. 6, except the accelerator 

output, which is not necessarily the network port from which does not include the packet forwarding tables, and the 

th^^cket was received (step 922), router-side table in both bost-centric embodiments performs 

/y^ihis point, the packet forwarding table now stores the a different function than the router history table, as explained 

i^ddtess binding for a first packet in a stream of packets 5 below. 

' directed towards a particular destination, where the packet Generally, packets originate at a host. The host determines 

has been processed by the router. If the next received packet whether the destination IP address specified by the packet is 

is fi-om the network, then the network-side packet process- found in its own host tables (step 1102). If so, then the host 

ing logic 604 for the input port causes the memory controller uses its ARP cache to send the packet directly to the next hop 

) 620 to store the packet in shared memory 612 and to extract lO physical address (step 1104). 

the destination address from the IP datagram header (steps if qq match, the host forwards the packet to a default 

900. 902, 904). The packet processing logic then causes the jo^te^ through the intervening router acceleration switch of 

IP destination address to bejQput to the associated packet the invention (step 1106). At this point, the switch just passes 

forwarding table (step 9061 If the table indicates a hit, then packet to the router, 

the table transfeis the associated physical (layer 2) next hop 15 ^ ^^^^^^ router binds the next hop physical address 

address to the packet processing logic (steps 908, 910). The ^ ,1,^ destination IP address (step U08). The router then 

packet processing logic causes the memory controUer to ^ encapsulated in a physical frame with 

forward the IP datagram to the packet processing lo^c 604, ^^ ^^^^^ ^^ muter-side port 

which encapsulate the datagram m a frame includmg the ^^^^ ^^ ^^^^ ^ destination. The packet processing 

ayer 2 next hop address (step 912). Tbt packet processing 20 ^ ^ ^^^^ ^^^^ 

logic 604 then forwards the frame to the next hop desUnation ^ router-side data bus 626. The router- 

on the appropriate output port (step 914). p^^^^j processing logic 606 also instructs the memory 

Id some instances, a packet is originated at the router controller 620 to ou^ut the frame from the shared memory 

(steps 902, 904). Such packets typicaUy convey status 512 onto the network-side data bus 624. and instructs the 

information, such as data to update routing tables. The router " packet processing logic 604 of the network port conrespond- 

encapsulates these packets in a physical frame specifying the jng to the router port to forward it to the next hop (step 

next hop destination. Router-generated packets are not 1112) 

stored in the router history table. The router-side packet 3^^^^ ^^j^^ Uinsmils the packet to the next hop. the 

processing logic that received the router-generated packet microprocessor 614 of the accelerator switch stores the 

stores 4efi«me m a shared memory queue. The memoir is destination IP/next hop physical address binding in the 

logicaUy subdivided mto a queue for each output port. When ^uter-side table 610 (the "ARP table" in this embodiment) 

It IS found by the router-sude PPL that the pactet ongmated ^ .^^^ ^ ^ 

from the router, then the PPL forwards the packet to shared jj, where the destination IP and next hop physical 

memory. When the packet reaches the top of the queue >t is .jdresses are associatively stored, 

transmitted from the correspondmg output port (step 924). _ •. u .u . 1 . . ... • .u v . * j- 

„ , , , „ The switeh then takes steps to tncK the host forwarding 

Packets addressed directly to the router s P address from ^^les into storing this binding. Tlie switch microprocessor 

die network ate sent by the receiving port s network-side j j^^^^^, ^ p^MP) 

PPL to the router, but not entered m the router history tabic ^ ^ f^^^^^ ,cmp message is encapsulated 

because it is not expected that the router will return them. ^ ^^^ip datagram, which is itself encapsulated in a frame for 

The first embodiment described above relies entirely on transmission. One type of ICMP message is a redirect 

tables stored in the switch. The foUowing second and third message. In conventional systems, routers employ redirect 

embodiments may be deemed "bost-centric" because they messages to update host routing tables. A host boots up 

store relevant routing information in pre-existing routing knowing the address of only one router on the local network, 

tables in the host. Yhis default router returns an ICMP redirect message when- 

The host-centric router accelerator relies upon host rout- ever a host sends a datagram for which there is an initial 

ing tables that can perform the address resolution protocol router along a more optimal path. The redirect message 

(ARP) function. To perform ARP, a host sends out a broad- replaces the default router in the table with the address of 

cast message to all nodes within its broadcast domain, i.e., this more optimal router. 

local area network. The ARP message contains an IP desti- 50 According to the present invention, the microprocessor in 

nation address and a request for the node having that address t^e switch sends a redirect message to the host to cause the 

to transmit back its physical address. In this manner, the host host table to associate the destination IP address of the 

binds an IP address to a physical address. packet with itself so as to act as the next hop IP address in 

FIGS. lOA and lOB illustrate relevant entries in two the table (step 1114). Based upon this table entry, the host 

routing tables found in a conventional host. The first table 55 sends an ARP message to obtain the corresponding physical 

associatively stores a destination IP (second-level) address address (step 1116). In a conventional system, the ARP 

along with a corresponding next hop IP (second-level) message would be used to obtain a physical address corre- 

address. The second table, known as the host ARP cache or spending to an IP address in the table. Here, because the 

forwarding table, associatively stores a next hop IP (second- destination IP address has been substituted for the next hop 

level) address along with the corresponding next hop physi- eo IP address, a conventional system would attempt to return a 

cal (first-level) address. These two tables together effectively physical destination address. However, in a conventional 

provide a mapping of destination IP address to next hop system this would only work if the destination node were 

physical address. The reason two tables are employed is that within the same local network (broadcast domain) as the 

the address boundary between the layers is maintained by host. In contrast, the present invention employs the ARP 

using this format. 6S table of the switch to intercept and respond to the host's ARP 

The operation of one host-centric embodiment is iUus- request. Recall that the ARP table has already stored the 

trated in FIG. 11. The structure of the router accelerator is address binding computed by the router. In this manner, the 
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switch acts as an ARP server to return the next hop physical 
address in response to the IP destination address sent by the 
host (step 1118). The host then adds this next hop physical 
address to its ARP cadie, the second host table (step 1120). 

The switch then waits for the next packet (step 1100). The 
microprocessor 614 in the switch determines whether the 
destination address specified by this packet matches a des- 
tination address in the ARP cache (step 1102). If so, the host 
responds with the next hop physical address. The 
microprocessor, coupled to the host network, then forwards 
the packet to the next hop destination at layer 2 (step 1104). 
Subsequent packets directed to the same destination IP 
address are handled in a similar manner. Through this 
technique, the host takes much of the burden ofif of the 
default router for sending packets to the first hop. 

Now that the host knows the next hop IP address and the 
physical address of the next hop, it will send subsequent 
packets having that destination IP address direaly to the 
physical address of the next hop. This process can be 
handled by the conventional layer 2 switching of the switch, 
which is found in the router-side packet processing logic 
606, microprocessor 614 and memory controller 620. 

Unlike the previous embodiment, this approach uses 
existing memory in the host to accomplidi functions that 
would otherwise require memory in the switch. The switch 
need not store information for every host. Instead, the 
information is distributed among the hosts. Moreover, the 
switch only needs the address information one time for the 
ARP table, instead of maintaining the information for every 
packet it sends (as in the packet forwarding table of the 
previous embodiment). Finally, the switch does not need an 
high-speed memory like a CAM. Instead, it employs a 
software table for its ARP response function. 

The operation of a second host-cenuic embodiment is 
illustrated in FIG. 13. As can be seen from a comparison 
with FIG, 11, many of the initial steps are the same. As 
before, the switch stores the destination IP addressAiext hop 
physical address binding in the ARP table in DRAM (step 
1310). However, in this embodiment, the switch also 
employs the reverse address resolution protocol (RARP). 

The microprocessor 614 in the switch issues to the router 
a RARP request containing the next hop physical address in 
order to obtain the next hop IP address from the router (step 
1314). According to RARP, a machine on the network sends 
out a broadcast message containing a physical address to 
nodes on its local network in order to obtain a corresponding 
IP address. Conventional routers can act as RARP servers. 
The microprocessor in the switch stores the RARP next hop 
physical address/next hop IP address binding in the table 
illustrated in FIG. 14, which is also stored in DRAM (step 
1316). 

Unlike the previous embodiment, the switch sends the 
real next hop IP address to the first host table (step 1318). 
The host issues an ARP request with this next hop IP address 
in order to obtain the next hop physical address (step 1320). 
The switch acts as an ARP server to return the next hop 
physical address (step 1322). The host adds the next hop 
physical address to its ARP cache (step 1324). Subsequent 
packets directed to the same destination IP address then 
match in the host tables. Again, by filling the host tables with 
the next hop physical address information, the host can 
facilitate layer 2 switching, therefore bypassing the router. 

The RARP approach does not require the counterfeit 
redirect message of the previous embodiment. The redirect 
message of the previous embodiment may be rejected by 
Internet security systems diat are sensitive to such ARP 
spoofing. 
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Based on the foregoing, it can be seen that the router 
accelerator of the present invention need not receive and 
maintain information about the status of the network or 
execute routing algorithms. Thus, packet forwarding for the 
vast majority of packets in a source-destination stream can 
be performed by the high-speed router accelerator, which is 
implemented entirely in hardware. Further, performance of 
the router accelerator is not hampered by the need to 
maintain and update software for implementing routing 
algorithms as the algorithms change. 

Although the invention has been described in conjunction 
with particular embodiments, it will be appreciated that 
various modifications and alterations may be made by those 
skilled in the art without departing from the spirit and scope 
of the invention. The invention is not to be limited by the 
foregoing illustrative details, but rather is to be defined by 
the amended claims. 

What is claimed is: 

1. A router accelerator switch comprising: 

a forwarding table for associatively storing a destination 
address and a next hop address; 

logic for forwarding a packet to a next hop if a destination 
address of the packet matches a destination address in 
the forwarding table, wherein an associated router does 
not process the packet if the packet destination address 
matches a destination address in the forwarding table, 
and logic for communicating the packet to the router if 
the packet destination address does not match a desti- 
nation address in the forwarding table, wherein the 
router computes the next hop address corresponding to 
the destination address of the packet if the packet 
destination address does not match a destination 
address in the forwarding table. 

2. The router accelerator switch of claim 1, wherein the 
destination address is a network address and the next hop 
address is a physical address. 

3. The router accelerator switch of claim 1, wherein the 
router is coupled to at least one network port through the 
router accelerator switch. 

4. Hie router accelerator switch of claim 3, wherein the 
router binds the next hop address to the destination address 
of the packet if the packet destination address does not 
match a destination address in the forwarding table. 

5. The router accelerator switch of claim 1, further com- 
prising a router history table for associatively storing at least 
one destination address and a corresponding address of a 
network port that received a packet having that destination 
address. 

6. The router accelerator switch of claim 5, wherein each 
network port has an associated forwarding table, and the 
accelerator stores the next hop address in the forwarding 
table for a port if the port is associated in the history table 
with a destination address that is bound to the next hop 
address. 

7. In a network having at least one host and at least one 
router, the host having at least one routing table for asso- 
ciatively storing a second-level destination address, a 
second-level next hop address and a first-level next hop 
address, the router for binding a first-level next hop address 
to a second-level destination address, a router accelerator 
switch comprising: 

redirect logic for storing the second-level destination 
address in a second-level next hop address entry in the 
at least one host routing table to thereby cause the host 
to request a first-level next hop address corresponding 
to the second-level destination address; 

an accelerator table for associatively storing the bound 
second-level destination address and first-level next 
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hop address, wherein the accelerator table responds to 
the host's request with the bound first-level next hop 
address. 

8. The router accelerator switch of claim 7, wherein 
first-level addresses are physical addresses and second-level 
addresses are network addresses. 

9. The router accelerator switch of claim 7, wherein the 
router is coupled to at least one network port through the 
router accelerator. 

10. The router accelerator switch of claim 7, wherein the 
host associatively stores the bound second-level destination 
address and first -level next hop address in a host forwarding 
table, 

11. The router accelerator switch of claim 10, wherein the 
host forwards a packet to a next hop if a second-level 
destination address of the packet matches a second-level 
destination address in the host forwarding table. 

12. The router accelerator switch of claim 10, wherein a 
router coupled to the accelerator does not process the packet 
if the second-level destination address of the packet matches 
a second-level destination address in the host forwarding 
table. 

13. In a network having at least one host and at least one 
router, the host having at least one routing table for asso- 
ciatively storing a second-level destination address, a 
second-level next hop address and a first-level next hop 
address, the router for binding a first-level next hop address 
to a seoond-level destination address, a router accelerator 
switch comprising: 

request logic for requesting from the router a second-level 
next bop address in response to the first-level next hop 
address; 

redirect logic for storing the second -level next hop 
address in a second-level next hop address entry in the 
at least one host routing table to thereby cause the host 
to request a first-level next hop address corresponding 
to the second-level next hop address; 

an accelerator table for associatively storing the corre- 
sponding second-level next hop address and first-level 
next hop address, wherein the accelerator table 
responds to the host's request with the bound first-level 
next hop address. 

14. The router accelerator switch of claim 13, wherein 
first-level addresses are physical addresses and second-level 
addresses are network addresses. 

15. The router accelerator switch of claim 13, wherein the 
router is coupled to at least one network port through the 
router accelerator 

16- The router accelerator switch of claim 13, wherein the 
host associatively stores the bound second-level destination 
address and first-level next hop address in a host forwarding 
table. 

17. The router accelerator switch of claim 16, wherein the 
host forwards a packet to a next hop if a second-level 
destination address of the packet matches a second-level 
destination address in the host forwarding table. 

18. The router accelerator switch of claim 13, wherein a 
router coupled to the accelerator does not process the packet 
if the second-level destination address of the packet matches 
a second-level destination address in the host forwarding 
table. 

19. A method for accelerating a router comprising the 
steps of: 

associatively storing a destination address and a next hop 
address in a forwarding table of a router accelerator 
switch; and 
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forwarding a packet to a next hop if a destination address 
of the packet matches a destination address in the 
forwarding table, wherein an associated router docs not 
process the packet if the packet destination address 
matches a destination address in the forwarding table, 
and communicating the packet to the router if the 
packet destination address does not match a destination 
address in the forwarding table, wherein the router 
computes the next hop address corresponding to the 
destination address of the packet if the packet destina- 
tion address does not match a destination address in the 
forwarding table. 

20. The method of claim 19, wherein the destination 
address is a network address and the next hop address is a 
physical address. 

21. The method of claim 19, wherein the router is coupled 
to at least one network port through the router accelerator. 

22. The method of claim 21, furdier comprising the step 
of the router binding the next hop address to the destination 
address of the packet if the packet destination address does 
not match a destination address in the forwarding table. 

23. The method of claim 19, further comprising the step 
of associatively storing in a router history table at least one 
destination address and a corresponding address of a net- 
work port that received a packet having that destination 
address. 

24. The method of claim 23, wherein each network port 
has an associated forwarding table, the method further 
comprising the step of storing the next hop address in the 
forwarding table for a port if the port is associated in the 
history table with a destination address that is bound to the 
next hop address. 

25. In a network having at least one host and at least one 
router, the host having at least one routing table for asso- 
ciatively storing a second-level destination address, a 
second-level next hop address and a first-level next hop 
address, the router for binding a first-level next hop address 
to a second-level destination address, a method for acceler- 
ating the router comprising the steps of: 

storing the second-level destination address in a second- 
level next hop address entry in the at least one host 
routing table to thereby cause the host to request a 
first-level next hop address corresponding to the 
second-level destination address; and 

associatively storing in an accelerator table the bound 
second-level destination address and first-level next 
hop address, wherein the accelerator table re^onds to 
the host's request with the bound first-level next hop 
address. 

26. The method of claim 25, wherein first-level addresses 
are physical addresses and second-level addresses are net- 
work addresses. 

27. The method of claim 25, wherein the router is coupled 
to at least one network port through the router accelerator. 

28. The method of claim 25, wherein the host associa- 
tively stores the bound second-level destination address and 
first- level next hop address in a host forwarding table. 

29. The method of claim 28, wherein the host forwards a 
packet to a next hop if a second-level destination address of 
the packet matches a second-level destination address in the 
host forwarding table. 

30. The method of claim 28, wherein a router coupled to 
the accelerator does not process the packet if the second- 
level destination address of the packet matches a second- 
level destination address in the host forwarding table. 

31. In a network having at least one host and at least one 
router, the host having at least one routing table for asso- 
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cialively storing a second-level destinalioa address, a 
second-level next hop address and a first-level next hop 
address, the router for binding a first-level next hop address 
to a second-level destination address, a method for acceler- 
ating the router comprising the steps of; 

requesting from the router a second-level next hop 
address in response to the first-level next hop address; 
storing the second-level next hop address in a second- 
level next hop address entry in the at least one host 
routing table to thereby caiise the host to request a 
first-level next hop address corresponding to the 
second-level next hop address; 
associatively storing in an accelerator table the corre- 
sponding second-level next hop address and first-level 
next hop address, wherein the accelerator table 
responds to the host's request with the bound first-level 
next hop address. 
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32. The method of claim 31, wherein first-level addresses 
are physical addresses and second-level addresses are net- 
work addresses. 

33. The method of claim 31, wherein the router is coupled 
to at least one network port through the router accelerator. 

34. The method of claim 31, wherein the host associa- 
tively stores the bound second-level destination address and 
first-level next hop address in a host forwarding table. 

35. The method of claim 34, wherein the host forwards a 
packet to a next hop if a second-level destination address of 
the packet matches a second-level destination address in the 
host forwarding table. 

36. The method of claim 31, wherein the router does not 
process a packet if the second-level destination address of . 
the packet matches a second-level destination address in the 
host forwarding table. 
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[57] ABSTRACT 

An improved partial packet filter (10) for filtering data 
packets (210) in a computer network (12) wherein a candi- 
date field (413) of the data pack^ (210) is hashed to a 
plurality of bit- wise subsets (636) each being an independent 
representation of the candidate field (413). Each of the 
bit-wise subsets (636) is compared to a reference hash table 
(644) which has been prepared in a prelinunary operation 
series (514). The prehminary operation series (512) config- 
ures a plurality of target fields C714) to set selected memory 
locations (312) in the reference hash table (644). 
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PACKET FILTERING FOR DATA 
NETWORKS 

TECHNICAL FIELD 

5 

The present invention relates generally to the field of 
computer science and more particulariy to data networking 
and component devices attached to data networks. 

BACKGROUND ART jq 

Computer networks are becoming increasingly common 
in industry, education and the public sector. The media over 
which data arc carried generally carry data in units referred 
to as "packets" which arc destined for many dififcrent 
sources. Addressing and packet typing are included in most 
standardized and proprietary packet based networking pro- 
tocols which make use of destination address fields at the 
beginning of and/or within each data packet for the purpose 
of distinguishing proper recipient(s) of the data of the 
packets. As a packet is received at intermediate and end ^ 
components in a system, rapid determination of the proper 
recipient (s) for the data must be made in order to efficiently 
accept, forward, or discard the data packet Such determi- 
nations are made based upon the above discussed address, 
packet type and/or other fields within the relevant packets. ^ 
These determinations can be made by network controller 
hardware alone^ by a combination of h^ware and software, 
or by software alone. In broadcast type netwoiks, every node 
is responsible for examining every packet and accepting 
those "of interest", while rejecting all others. This is called ^ 
**packet fSliering". Accuracy, speed and economy of the 
filtering mechanism are all of importance. 

When the above discussed determinations are made 
through a combination of hardware and software, the hard- 
ware is said to have accomplished a **partial filtering" of the 
incoming packet stream. It should be noted that one type of 
packet filtering is accomplished on the basis of packet error 
characteristics such as collision fragments known as "runts", 
frame check sequence errors, and the like. The type of ^ 
filtering relevant to the present discussion is based upon 
packet filtering in which filtering criteria can be expressed as 
simple Boolean functions of fields within the packet as 
(^posed to filtering based upon detection of errors or 
improperly formed packets. 

In the simplest case, each node of a computer network 
must capture those packets whose destination address field 
matches the node's unique address. However there fre- 
quently occur situations in which additional packets are also 
of interest One example occurs when the node belongs to a 5q 
predefined set of nodes all of which simultaneously receive 
certain specific ^'groupcast" packets which are addressed to 
that group. Groupcast packets are usually identified by some 
variation of the address field of the packet. Groupcast 
address types generally fall imo one of two forms. "Broad- 55 
cast" addresses are intended for all nodes and ''multicast'* 
addresses are targeted for specific ^plications to which 
subsets of nodes are registered. Another case of such field- 
based packet filtering occurs when certain network manage- 
ment nodes are adapted to focus on specific protocols, 60 
inter-node transactions, or the like, to the exclusion of all 
other traffic. 

Attachment of a networked device to the network is 
realized through a "controller" which operates indepen- 
dently of the host processor. Packet filtering then occurs in 65 
two successive stages beginning at the controller, which 
examines packets in real-time. To accomplish this, the 



controller is "conditioned** with an appropriate subset of the 
specified filtering criteria, according to the filtering capa- 
bilities of that controller. Hie controller classifies packets 
into three categories: Those not satisfying the filter criteria 
(**rejecu"); those satisfying the criteria ("exact matches'*); 
and those possibly satis^ing the criteria ('^partial matches**). 
Rejects are not delivered to the processor. Those packets 
which are classified as exact or as possible matches are 
delivered, with appropriate indications of their classifica- 
tion, to the device processor. The controller, ideally, 
excludes as many unwanted packets as its capabilities will 
allow, and the host processor (with the appropriate software 
operating therein) completes the overall filtering operation, 
as required. The value of filtering packets at the controller 
level (the partial filtering) is that it reduces the burden on the 
host processor. 

Controller filtering implementations are constrained by 
the fact that they must process packets in real-time with 
packet reception. This places a high value on filtering 
mechanisms that can be implemented with a minimum 
amount of logic and memory. Controller based filtering 
criteria are contained in a target memory. In the case of exact 
matching, a literal list of desired targets is stored in the target 
memory. While exact matching provides essentially perfect 
filtering, it can be used in applications wherein there are only 
a very small number of targets. 

Partial filtering is employed when the potential number of 
targets is relatively large, such as is often the case in 
multicast applications. A primary consideration is the "effi- 
ciency** of the partial filter. Efficiency (E), in this context, 
may be expressed as: 

E=Tn/Pn 

where: 

Tn=4he number of target packets of interest; and 
Pn==the number of potential candidates delivered to the 
processor. 

An efBdency of E=1.0 lepresents an exaa filtering effi- 
ciency wherein every candidate is a desired targeL This is 
the efficiency of the filtering which occurs in the "exact 
matching** previously discussed herein. While exact filtering 
efiSciency is an objective, the previously mentioned con- 
straints, including that the controller must do its filtering in 
essentially real-time, will generally not allow for such 
efficiency. 

The predominam method used in the prior art for partial 
packet filtering is "hashing**. The process conventionally 
begins with the extraction from each received packet of all 
fields involved in the specified filtcrir^g critoia. The com- 
posite of such relevant fields is called the "candidate field**. 
Assurrung an even distribution of candidate fields (a situa- 
tion that is not always literally accurate, but the assumption 
of which is useful for purposes of analysis)^ there will be a 
potential number of packet candidates of 2^ where Cb is the 
number of bits in the candidate field. The hashing function 
produces a reduction in the bit size of the candidate field 
according to a "hashing function**. As a part of the initiation 
of the controller, the bashing function is applied to each field 
of the target memory to assign a "target hash value** to each 
such field. The controller memory is initialized as a bit mask 
representing the set of target hash values. Then, during 
operation, a "candidate hash value** is created by applying 
the hashing function to each candidate field. The candidate 
hash value is used as a bit index into the controller memory, 
with a match indicating a possible candidate. 

As can be appreciated in light of the above discussion and 
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from a general understanding of simple hashing operations, 
the hashing function has the effect of partitioning the 2^^ 
candidate possibilities into Mb groups (called "buckets"), 
where Mb is the number of bits in the controller's target 
memory. Because candidate packets tteit fall into the same 
bucket are not distinguished, a "hit" represents any of 
2*^/Mb candidates. Useful hashing functions will partition 
the candidate possibilities in a roughly uniform distribution 
across the set of Mb buckeu. For a single target, the 
efficiency of such a hashing method is Mh/2 If Tn desired 
targets are represented by Bn buckets (where Bn<=Tn and 
Bn<=Mb, the efBciency of such a hashing method is: 

In exact matching* target n«mory could hold Mb/Cb 
targets. Hashing is appropriate when the number of buckets 
(Bn) is larger than this figure. However, effective hashing 
also requires that the number of buckets be less than Mb. 
because as target memory density increases there is less 
differentiation among cauididate fields. M^th the target 
memory full of hash targets, Bn=Mb and the efficiency is 
Tn/2". 

As can be appreciated, the described prior art hashing 
method used for partial packet filtering implies a loss of 
information in that a single hash value potentially represents 
a large set of candidates. Clearly, it would be desirable to 
reduce such loss of data. Correspondingly, it would desirable 
to maximize the filtering efBciency for a given Mb or (or to 
minimize the Mb for a given filter efficiency). 

To the inventor's knowledge, no prior art method for 
partial packet filtering has improved efficiency or reduced 
data loss as compared to the conventional hashing method 
described above. 

DISCLOSURE OF INVENTION 

Accordingly, it is an object of the present invention to 
provide a method and means for efficiently performing a 
partial filtering operation on data packets in a computer 
network. 

It is another object of the present invention to provide a 
method and means for partial packet filtering which rejects 
a maximum number of incoming packets which arc not at 
interest without requiring a large target memory and without 
unduly slowing down the processing of incoming packets. 

It is still another object of the present invention to provide 
a partial packet filtering method and means which is inex- 
pensive to implement. 

It is yet another object of the present invention to provide 
a partial packet filtering method and means which will 
operate in real-lime or near real-time. 

It is still another object of the present invention to provide 
a partial packet filtering method and means which is adapt- 
able to a variety of network system requirements. 

Briefly, the preferred embodiment of the present invention 
implements multiple independent hashing functions applied 
in parallel to_diC:ff ndida te faldo j^each packe t The com- 
bined applirfltion t^mmuple m^c ^ident hashinR^ &mctions 
results in si)ecification^f .ajias hfmatrix, with each coordi- 
^Ste of Jhe-hasb^na trix being the result of one of the ha shing 
^mctigns. The hash matrix includes the results of different 
hashing algorithms applied to a single candidate field, or the 
same hashing function applied to different subsets of the 
candidate field, or a combination thereof. The filter param- 
eters consist of the set of acceptable result values for each 
hashing operation. 
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An advantage of the present invention is that partial 
padcct filtering efficiency is improved, thereby freeing the 
host processor from a substantial portion of the packet 
filtering operation. 

Yet another advantage of the present invention is that 
filtering efficiency is increased geometrically with an 
increase in target memory. 

Still another advantage of the present invention is that a 
minimum amount of target memory is required for a specific 
target efficiency. 

Yet another advantage of the present invention is that the 
partial packet filtering can be performed in a minimum 
amount of time for a given target efficiency. 

These and other objects and advantages of the present 
invention will become clear to those skilled in the art in view 
of the description of the best presentiy known modes of 
carrying out the invention and the industrial applicability of 
the preferred embodiments as described herein and as illus- 
trated in the several figures of the drawing. 

BRIEF DESCRIPTION OF THE DRAWING 

FIG. 1 is a block diagram depicting a portion of a 
computer network with an improved partial packet filter 
according to the present invention in place therein; 

FIG. 2 is a diagrammatic representation of a conventional 
prior an Ethernet data packet; 

FIG. 3 is diagranunatic representation of a hash table; 

FIG. 4 is a flow chart showing a conventional prior art 
partial packet filtering operation; 

FIG. 5 is a block depiction of a partial packet filtering 
method according to the present invention; 

PIG. 6 is a flow chart, similar to the chart of FIG. 4, 
depicting the packet |nx)cessing operation series of FIG. 5; 
and 

HG. 7 is a flow chart depicting the preliminary operation 
series of FIG. 5. 



BEST MODE FOR CARRYING OUT 
INVENTION 

The best presentiy known mode for carrying out the 
tnvration is a partial packet filter for implementation in a 
persona] computer resident Ethernet controller. The pre- 
dominant expected usage of the inventive improved packet 
filter is in the interconnection of computer devices, particu- 
larly in network enviroimienls where there are relatively few 
targets. 

The improved partial tracket filtS LPf die presentiy pre- 
<;Tn>vvW,^^ of the ptcaent inve ntion is illus trated in 
a block_f ^^« C^"" in Fir,. I an d is designated tiierein by die 
reference character 10. In the diagram of FIG. 1, the 
improved partial packet filter 10 is shown configured as part 
of a network system 12 (only a portion of which is shown in 
the view of FIG. 1). In many respects, die best presently 
known cmbodimem 10 of the present invention is stmctur- 
ally not unlike conventional partial packet filter mecha- 
nisms. Like prior art conventional partial packet filters, the 
best presentiy known embodiment 10 of the present inven- 
tion has a controller 14 with an associated target memory 16. 
In Uie example of FIG. 1, tiie improved partial packet filter 
10 receives data from a network node 18 and performs the 
inventive improved packet filtering process on such data 
before passing selected portions of the data on to a host 
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pTDces^Jg.ta-which-the-improved*parti^^ filter 10 

FIG. 2 is a diagrammatic representation of a conventional 
Ethernet data packet 210. The standardized Ethernet packet 
210 has a preamble 212 which is 64 bits in length, a ^ 
destination address 214 which is 48 bits in length, a source 
address 216 which is 48 bits in length, a lengthAype field 218 
which is 16 bits in length and a data field 220 which is 
variable in length from a mininuiin of 46 eight bit bytes to 
a maximum of 1500 bytes. Following the data field 220 in 
the packet 210 is a 4 byte (32 bit) frame sequence check 
("FCS") 222. The packet 210 is transmitted serially begin- 
ning at a "head" 224 and ending at a 'tail" 226 thereof. The 
preamble 212, destination address 214, source address 216 
and length/type field 218 are collectively referred to as the 
header 219. 

FIG. 3 is a diagraminatic representation of a conventional 
single dimensional hash table 310 with which one skilled in 
the ait will be familiar. The hash table 310 has^a.pluraiity of 
address locations 312 each of which can be'*^et**^(set to 1) ^ 
or left^i mset (set to z ero). ^^^^ 

FIG. 4 is a flow diagram depicting the operation of a 
conventional prior art partial padcet filtering ope ration 4 10. 
As previousl y discussed briefl y^ pac ket 21 0r OTG^ 2^ is ^ 
received (receive pa^et^peration 412) firam the network 18 
(FIG, 1) andTcandjdate^field 413 (such as the header 219 
of the packet 210) is cexg5^ r(cxtract candidate field 
operation 414). A hashing operation 416 is performed on the 
extracted candidate field 4 13 tD_produce a hash value 417 
and t he hash vahic 417 is comofflrcd toT B^ 
(FIG. 3) s t<»cd in the ta rgeL_mcmoiy^ life (HQ. 1) in a 
comparisqn7Dp gauon~4 lirtrtKriSDrof the comparison 
operation 418ls a match, the packet 210 is forwarded in a 
forward packet operation 420. If the result of the comparison 
opCTation''41g^sy6t^am Sch, the packet 210 is rej ected 422 
in ajS^railSLop^adSJn, It should be remembered that 
the use of die header 219 here is an example only, and any 
portion or combined portions of the packet 210 might 
constitute the candidate field 413 in a given application. ^ 

FIG. 5 is a flow diagram depicting the inventive improved 
packa filtering process 510. The improved packet filtering 
process 510 is accomplished in a preliminary operation 
series 512 and a packet processing operati on 514, each of 
which-is>repcatcd-as-rcquiiBd. ,as will be discussed h CTih- 45 
a fter. T he preliminary operation scri^es51 2 is aooo mpKshed 
ac^gn gto software resi djttgrjaJto ejMtprop M 
VTvo configure theC ^rget meniqq^j :6 (FIG 1) as will be 
discussed heretnafterlt should benoted that the fact that the 
i mproved packet fjlteri n B^process 510 is^d iaided iniaihe two 
main operation cate gories (the preliminary operation series 
512 aiul the packet processing operation 514) does not 
distinguish this invention over the prior art Rather, the 
processes within^ the pr^imiiian^operaljon-acric ^_512 and 
theji^ey)rocessing.operalion.514 describe^the essence of 55 
the inventive process . 

FIG. 6 is a flow chart showing the inventive packet 
processing operstian 514 in a manner analogous to the 
]»esentation of the prior art partial packet filtering operation 
410 depicted in FIG. 4, As can be seen in the view of FIG. 60 
6, the packet processing operation series 514 is similar in 
many respects to the prior art partial packet filtering process 
410 (FIG. 4). IiLt he packet processin g o pera tion scries 514, 
a packet 210 (FIG. 2) is received (receive packet operation 
412)^d-a j:arrdi"dge !IfieltL413 is extracted in an extract 65 
candidate field opcr atioiu414.-J D the best presentl y known 
e mbodiment I jLoL-tbe^ present invc ntion,-the.,inycntive 



6 

packet processing operation s^es 514 next performs a 
candidate field reduction operation 626. In the best presentiy 
known cmbo^incntJft^ofjhCLpjcsentani^eT^ the candi- 
date jeld-redu ction operation_626Js merely the a pplication 
of the^ conventionaljCRC-PolYOomial al g orithm to the ca n- 
didatcJ&cldJ13ao4tield,a.32..hitjatC^oi^i^^ 
(although^yjofjinumber of sumlar alg orithms might be 
applied for this pmpbse)rNext^ a subset selection operation 
630 selects a predetermined number (two in the example of 
FIG. 6) of bit-wise.s ubsetsj 636 from the CRC o utput value 
628. Tlie jneth od^cdctermiriingLihe-quantity^oL bit-wise 
s ubset3_0 6 to be selected in the subset selection operation 
630, a nd the s izcoJ Uach, will be discussed heieinaftg : In the 
best presentiy^known ,emb(odime nt^l0.of_the.present inven- 
tion, the bit-j>vise ,subscts^636^ate«eacli.6 bite in le ngth. It 
sho»dd^bejigted.th^jLnjhe^besi.pc 
ment^lO^of, the present i nvmtion , the_bit-wis^su|^e5!636 
ar c selected from the CTC output value" ISM 'Smpiy by 
taking die first(1S']KtB of the CRC output value 628, the 
second six Jits,jmdsocm until as many bit-wise subsets as 
are ceedeci are obtained and so, in die be^ pxsentiy known 
embodiment 10 of the present invention, the bit wise subset 
636 are "consecutive bit section of the fixed size field (the 
CRC output value 628 in the best presentiy known embodi- 
ment 10 of the presem invention. The inventors have deter- 
mined that the bits of the CRC output value 628 (resulting 
from the CRC polynomial function) are independent of each 
other, and so any 6 bit portion of the CRC output value 628 
is as representative of the CRC output value 628 as is any 
other 6 bit portioa 

The bit-wise subsets 636 are then compared to the hash 
table 310 (FIG. 3) stored in the target memory 16 (FIG. 1) 
in a comparison operation 642. The combined multiple hash 
values 636 may be considered to be a hash matrix 638 (in the 
example of FIG. 6, a two dimensional hash matrix 638). 

It is important to note that the essence of the present 
inventive method lies in the extraction of the plurality of 
independent or relatively iiMlcpcndcnt representative indices 
of the candidate field 413 ("candidate filled indices") which, 
in die example of the best presentiy known embodiment 10 
of the present invention are the bit-wise subsets 636 which 
make up the hash matrix 638. That is, die bit- wise subsets 
636 are representative fields in that die bit-wise subsets 636 
are representative of die candidate field 413, as discussed 
above. The generally simultaneous (parallel) processing of 
these is the source of the advantages of the present inventive 
method and means. The exact method described herein in 
relation to the best presentiy known embodiment 10 of tiic 
present invention, that of first reducing the candidate field 
413 in the candidate field redugUon operat ion 626 and then 
e xtracting the trit-wise subscts63&~ir"but one^f inan y 
pote ntial methods for accomplishing such aj>arallel has hing 
o peration 639, and the present invention J s-not intende d to 
bgJiHat gd by this aspect of the best presenti v^known 
embodiment 10. 

In the best presentiy known embodiinem 10 of the present 
invention, in a comparison operation 642, each of the 
bit-wise subset s.636 iscoinp .aicd4o-a-refeFence-hash,iable 
644 (a/target^sb^T^') stored in the target meinory^6^ 
(FIG. l XBigdno nj y4Ul^^ 

in a pack^ fpiwdinfcjaperation 646. In Uic example of FIG. 
6, the reference hash table 644 will be a 64 element array 
representing all values from 0 through 63 inclusive. Some 
elements of the reference hash table 644 are set as will be 
discussed hereinafter in relation to the preliminary operation 
series 512. If the value of the bit-wise subset ^*faUs into one 
of the buckets" (is equivalent to a corresponding set bit in 
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the reference hash table 644), then the data packet 210 is 
defined as being a '*match". 

Now returning to a ronsideration of th e^prelirninary 
operatiOT series 512 (FIG. 5) with an understanding of the 
packet processing operation series 514, the target memory 
16 is configuied^ipTocMS_stci»much Hke those described 
in relation to tl»~pcker^occssing operation series 514 of 
HG. 6. 

FIG. 7 is a flow diagram of the preliminary operation 
series 512 according to the best presently known cmbodi- 
mem 10 of the present invention. A prdiminary operation 
which is common to both the prior art and the present 
invention is a taiget field(s) selection process 712. The target 
(field) 5 selection process is merely the selection of criteria 
to which incoming packets 210 are to be compared. For 
example, if the entire process is to be on the basis of desired 
destinations, then an intended destination address 214 (FIG. 
2) will be (one of) the taiget fiekl(5) 714, and if three 
destinations are of interest, then.theiejviU^beJ^nK^ 
fields J7X4_as.iUustratcd-in-ihe.example5ZHS--7.Jlij£.ae^ 
pwvrggtt mvnived in selecriiij ^e tarpet field(s) is a fu nction 
QfnetwoA^OTninjljMifiLwajrejKhich-^^ art 
^§g(51smyie^le:tanr^^ to 
ih^xtent_^^JtJicUwsJh^ 
invendvc_prriinun^_opc^^ 

Having detcnmned thejj uantity (^j argei fields 714 of 
ir^terest, hos tsoftware will next det ennmea bit-wise su bset 
quafetQC7l61(the appropriate "suGsetrqumSty" of bit-wise 
subset 636) in a bit-wise subset quantity determination 
operation 718. The bit-wise subset quantity determination 
operation 718 will be discussed in more detail hereinafter, as 
it can be better understood in light of the present description 
of the entire preliminary operation scries 512. For the 
present simpUfied example of FIGS. 6 and 7, and as already 
mentioned, the bit-wise subset quantity 716 is two. Thai is, 35 
two of the bit- wise subsets 636 arc to be extracted from the 
CRC output value 628 in the subset selection operation 630 
of HG. 6. 

As can be appreciated, the target fields 714 arc each 
equivalent in form to the candidate fields 413 discussed 40 
previously herein, and processing of the target fields 714 is 
much the same as has been previously described herein in 
relation to the candidate fields 413. In the inventive prelimi- 
nary operation series 512. each of the target fields 714 is 
processed in a target field reduction operation 726 by 43 
application of the CRC polynomial to produce a target CRC 
value 728. Each of the target CRC values 728 is then 
processed in a target subset selection operation 730 to 
produce a plurality (two for each target CRC value 728 for 
a total of six, in the present example) of target bit-wise 50 
subsets 736^bu35DnLgenmLterms^xach.of 
714 (havingbSnsderted according to prior art methods as 
discussed previously, herein) is processed as describ^ Lto 
l^oduc£a_;n^g^jcpcesemaiiAiEe-8^'* (the targa CRC value 
7m iTi the prejient example^, w hich is theniiirther o n oc MS^ 55 

in ^^s may,be '*taiBet strine^subsets38^orthc target rcpre - 
S CTtadve field and whi ch arc, in the present examp llc the 
t ^et bit-wise subsets V36. ihis process is alike to the 
process which is repeated as necessary to process each 
incoming data packet 210 . wherein the candidate fields 413 
are piocessed to produce a candidate representative field (the 
CRC output valxie 628 in the present example), which is 
further processed to produce the "can didate strin gsubsets'* 
(the bit-wise subsets 636 in the present cxample)rThe 65 
quantity of target bit- wise subsets 736 taken from each target 
CRC vaJu^M^ also the bit- wise subset quantity 716 (two . 



in the present example). It should be noted that a target 
parallel hashing operation 739 is like the previously 
described parallel hashing operation 639 in that the inven- 
tion might be practiced with variations of the specific steps 
therein which arc presented here as features of the best 
presently known embodiment 10 of the present invention. 

In a tarcet me mory setting operftti o" 740 the reference 
hggCSlrifi 6 44 ''^ font^ptted sun h jha!, P4ic h memory lyatio n 
312 <;iQiie&pfmdin fi^&-a-valuejaf^anv oLthe target bit-w ise 
^lite fJ?^^*^-^-^^ ^ example, if the first target bit^wise 

...I . fi^^ **nnnmn" Mf^fWrnnl vqIha thpn lHi> third 



subset 736a were **0q0010" (decimal value 2) then the third 
memory location 312^ the reference bash table 644 would 
be set to.*lL*\ as i« illustrated in FIG. 7. As can be a|ipreciaied 
from the above discussion, the maximum number of 
memory locations 312 in the reference hash table 644 which 
can be set by this process is the quantity of target bit- wise 
subsets 736 (six, in the present example). However, since 
two or more of the target bit-wise subsets might coinciden- 
tally hash to the same value, a lesser quantity of memory 
locations 312 might also be set. 

Now returning to a more detailed discussion of the 
bit-wise subset quantity determination operation 718, the 
target memory 16 is to be ccMjfigured to maximize the 
effectiveness of the filtering based on the quantity of mul- 
ticast packets 210 of interest to the software of the host 
processor Therefore, the bit-wise subset quantity determi- 
nation operation 718 attempts to determine (or. at least, to 
approximate) an op timal nu mber of jng^pCT pgdcet (and. 
lltta,jheJnuwise^bseLquanti^T16-discussei^ 
herein^ . The '^optimal" number here means t hat jyluch will 
iniramiz^the_numbcr_o£-l'uriintcrBSting" packets which 
match the set data bits 312 in the reference hash table 644 
while matching all of the ^Hnteresting" packets 210. In the 
best presently known embodiment 10 of the present inven- 
tion, the following table is used to determine the bit-wise 
subset quantity 716. 



TABl£ OF SUBSET QUANTmES 


Addresses of 


Number of Hash Indices 


Interest 


Bit-^ise Subset Quandty 716 


1-2 


5 


3 


4 


4-9 


3 


10-16 


2 


17 or more 


1 



60 



The above Ublc is oflFercd here as a guide only, in that the 
"optimal" number of selected hash indices may vary in ways 
not presenUy contemplated Furthermore, it should be noted 
that the above table is based upon an assumption that none 
of the target indices (the taiget bitwise subsets 736 in the 
best presently known embodiment 10 of the present inven- 
tion hash u> the same memory locations 312 in the reference 
hash table 644. If, indeed, two or more of the taiget bit-wise 
subsets 736 did hash to the same memory location 312, then 
additional hash indices could be added to increase efficiency 
without sacrificing speed or requiring additional memory or 
processing. 

It should be noted that while the packet processing 
operation scries 514 is accomplished in the hardware of the 
best presently known embodiment 10 of the present inven- 
tion, the preliminary operation series (which can be accom- 
plished at a more leisurely pace) is performed primarily by 
software of the host processor 20. As can be appreciated in 
light of the above discussion, the preliminary operation 
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series will be repealed when the network 12 is reconfigured, 
when it is desired to communicale with additional members 
of the network 12, or upon other occasions according to the 
needs of the user and the network 12. The packet processing 
operadon series 514 will be repeated whenever an incoming s 
packet is detected from the network node 18. 

It should also be noted that, while the best presently 
known embodiment 10 of the present invention hashes each 
of the CRC values 628 and 728 to a common reference bash 
tabic 644, the invention might be pracdced with equal 1^ 
efficiency by hashing each of the CRC values 628 and 728 
to its own individual hash table (not shown). Using the 
quantities of the example of FIGS. 6 and 7. each of the 
individual hash tables would be 32 bits (memory locations 
312) large (one half of 64 bits, since it must be divided 
between the two target CRC values 728). The individual 
bit-wise subsets 636 and 736 would then be S bits long 
(decimal value 0 through 31). 

Various modifications rmiy be made to the inventive 
in^ffoved packet filter 10 without altering its value or scope. 
For example, the quantity, size, and derivation of the plu- 
rality of bit-wise subsets 636 and 738 could readily be 
revised according to the parameters discussed herein. 

All of the above are only some of the examples of ^ 
available embodiments of the present invention. Those 
skilled in the art will readily observe that numerous other 
modifications and alterations may be made without depart- 
ing from the spirit and scope of the inventioa Accordingly, 
the above disclosure is not intended as limiting and the 
appended claims arc to be interpreted as encompassing the 
entire scope of the invention. 

INDUSTRIAL APPLICABILITY 

35 

The improved partial packet filter 10 is adapted to be 
widely used in computer netw(»rk communications. The 
predominant current usages are for the interconnection of 
computers and computer peripheral devices within networks 
and for the interconnection of several computer networks. 40 

The improved partial packet filters 10 of the present 
invention may be utilized in any application wherein con- 
ventional computer interconnection devices are used. A 
significani area of improvement is in the inclusion of the 
parallel processing of a plurality of indices (bit-wise subsets 
636) of a packet 

The efficiency of the filtering provided by the improved 
partial packet filter 10 is significantiy improved, particularly 
for cases where the number of targets is small relative to the 
numbo- of '^buckets" (memory locations 312). To 
the efficiency of the present inventive improved packet 
filtering process 510 embodied in the improved partial 
packet filter 10 with the prior art partial packet filtering 
process 410, assume, for example, the following values: 

Mb=64 (representing 64 memory locations 312 in the 
reference hash table 644) 

Cb=48 (representing a 48 bit candidate field 413 size — a 
typical size of die destination address 214 

Dit=4 (representing a bit- wise subset quantity 716 of four) 60 

Then, the prior art partial packet filtering process 410 will 
partition the 2*^ possibilities among 64 distinct buckets, one 
of which matches the bucket into which the single target 
falls. In the improved packet filtering process 510, the four 
parallel hashing functions partition among 16 possible buck- 63 
cts each. The efficiency (EQ for the prior art partial packet 
filtering process 410 would then be: 
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The efficiency (Ef4) for this example of the improved 
packet filtering process 510 is: 

The efficiency Ef4 is better than the efficiency Ef by a 
factor of 2'** (1024), which is to say that only a thousandth 
as many (uninteresting) packets will be delivered to the next 
stage of filtering using the inventive improved partial packet 
filter 10 as compared to the prior an. 

Filtering of packeu may be accomplished Uirough a 
combination of exaa and partial match filters. Typically, one 
or more partial filterings will occur first, with the multiple 
dimensions of each filtering accomplished in parallel with 
each other (according to the present invention). Packets 
which pass through the inventive improved partial packet 
filter 10 may then be filtered using an exact match filter 
technique, such as "binary search lookup" of the filter data 
in a sorted table of acceptable filter data values. Further- 
more, results of partial filtering can be used to determine 
which of many (possibly sorted) tables in which to search for 
the packet. 

Accordingly, the inventive improved packet filtering pro- 
cess 510 may be applied more than once to each incoming 
packet 210 (in a first stage and a second stage). In such an 
example, configuration of die first stage partial filtering 
would involve specification of the number and type of 
hashing operations to be performed, along with the portion 
of the packet which is to comprise the filter data for each 
such operation, along with acceptable resuhs for each. 
Multiple partial filterings may be configured with the speci- 
fication including the logical relation to apply to the results 
of each filtering. For example, partial filtering A might be to 
apply the 32 bit CRC polynomial to the destination address 
field of an Ethernet padcet, and retain die loWcst order 3 
bits — a value firam 0 to 7. Partial filtering B might be to 
^ply the 32 bit CRC polynomial to the source address field 
of the Ethernet packet, and retain the lowest order 3 bits. The 
logical relation might be to accept packets only for which the 
results of the first filtering (A) is either 2 or 4, and the result 
of the second filtering (B) is either a 3 or a 4. In a general 
case, one may expect die likelihood of arbitrarily filter data 
to "pass" the first filtering to be 2 in 8 {25%), since 2 of the 
8 values from 0 to 7 are acceptable. Similarly, the likelihood 
of die second filtering "passing" such a filter is 2 in 8 (25%). 
Assuming that the two filterings are, as desired, truly inde- 
pendent, dte likelihood of this arbitraiy packet being 
accepted is the product of these, or 1 in 16. Note further that 
the specification of these "accq)table result sets" ({2,4} for 
A and {3,4} for B) requires 16 bits of information for full 
specification, where 8 bits indicate the acceptability/unac- 
ceptability of each of the 8 possible values of filtering A, and 
8 additional bits indicate the acceptability/unacceptability of 
each of the 8 possible values of filtering B. Use of such 
multiple partial filterings may be especially effective in 
situations where filtering criteria are derived firom indepen- 
dent portions of the filter data, such as filtering for all 
packets whose destination address OR whose source address 
is within a set of interesting addresses AND whose packet 
type indicates a particular protocol of interest 

Since die improved partial packet filters of the present 
invention may be readily constructed and are compatible 
with existing computer equipment it is expected Uiai they 
will be acceptable in the industry as substitutes for conven- 
tional means and methods presenUy employed for partial 
packet filtering. For these and other reasons, it is expected 
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that the utility and industrial applicaWliiy of the invention 
will be both significant in scope and long-lasting in duration. 
We claim: 

1. A method for selectively forwarding a data packet and 
controlling the distribution of data packets in a computer 5 
network system, the data packet having a candidate field 
containing information about the data packet, the method 
comprising: 

configuring a target memory of a controller to contain a 
target hash array in steps including; ^° 
aa determining a target field and extracting a plurality 

of target indices from said target field, the target 

indices being a binary number having a value; 
ab setting memory locations in the target memory 

corresponding to the value of each of the target 15 

indices; and 
processing the data packet in steps including: 
ba extracting the candidate field from the data packet; 
bb extracting from the candidate field a plurality of ^ 

candidate field indices; 
be comparing the values of each of the candidate field 

indices to the target hash array; and 
bd forwarding the padcet when each of the values of 

each of the candidate field indices corresponds to a 

memory location of the target hash array which was 

set in step ab. 

2. The method of claim I, wherein: 

step aa is accomplished in substeps including: 

aal reducing the target fields to a plurality of target ^ 

representative fields; and 
aa2 selecting one or more target string subsets from the 

target representative field; and 
step bb is accomplished in substeps including: 
bbl reducing the candidate field to a plurality of 33 

candidate rcptesentative fields; and 
bb2 selecting one or more candidate string subsets from 

the target representative field. 

3. The method of claim 1, wherein: 

step ab is accomplished by causing only those memory 40 
locations in the target memory which correspond to the 
value of each of the target string subsets to contain a 
value of one. 

4. The method of claim 2, wherein: 

step aal is accomplished by applying a cyclic redundancy 
check algorithm to each of the target fields; and 

step bbl is accomplished by applying the same cyclic 
redundancy check algorithm to the candidate field. 

5. The method of claim 2, wherein: ^ 
in step aa2 the target string subsets are selected by 

extracting a plurality of target bit-wise subsets from the 
target representative field; and 
in step bb2 the carxlidate string subsets are selected by 
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extracting a plurality of candidate bit-wise subsets from 
the representative candidate field. 

6. The method of claim 2, and further including: 

an additional process step preceding step ab wherein a 
subset quantity is determined, the subset quantity being 
the number of target string subsets to be extracted from 
each of the target representative fields and also the 
number of candidate string subsets to be extracted from 
each of the candidate representative fields. 

7. The method of claim 6, wherein: 

the additional process step is accomplished, at least ini- 
tially^ by selecting the subset quantity from a table of 
subset quantities. 

8. The method of claim 2, wherein: 

each of the target representative target and the candidate 
representative field are 32 bits in length. 

9. The method of claim 1, wherein: 

steps aa through ab are repeated when a change in the 
distribution of data packets is de^red. 

10. The method of claim 1, wherein: 

steps ba through bd are repeated for each incoming data 
packet 

11. The method of claim 1, and further including: 

an additional process step preceding step ab wherein a 
subset quantity is determined, the subset quantity being 
the number of target indices to be extracted from each 
of the target fields and also the number of candidate 
indices to be extracted from each of the candidate 
fields.' 

12. The method of claim 11, wherein: 

the additional process step is accomplished, at least ini- 
tially, by selecting the subset quantity from a table of 
subset quantities appropriate to a quantity of target 
quantities. 

13. The method of claim 1, wherein: 

the candidate field inchidcs a target address field of the 
data packet 

14. The method of claim 1, wherein: 

the data packet is a standardized Ethernet data packet 

15. The method of claim 1, wherein: 

the target hash array is an unapportioned array such that 
each of the target indices is used to set memory 
locations in that unapportioned array. 

16. The method of claim 1, wherem: 

the target hash array is apportioned such that at least some 
of the target indices are directed to different portions of 
the target hash array. 

17. The method of claim 1, wherein: 

the target indices and the candidate indices are each a 
binary string of fixed bit length. 

» « « « * 
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ABSTRACT 



The present invention encompasses a method of storing 
ternary data that includes the steps of (1) initializing a 
conversion register by storing binary-to-temary mask data in 
a conversion register; (2) storing, ternary-data -in-a content 
addressable.memory (CAM) by inputting^a single^bit binary 
data to the conversion register, and converting the binary 
data into two bits of ternary data using the conversion 
register; and (3) simultaneously storing the two bits of 
ternary data in first and second memory cells. For subse- 
quent searching, the method further includes the steps of 
scarchiQ£_ for a match of input search binary data t o the 
stored conten ts of the CAM: providing ,a match vaMd ou tput 
resp onsive to _thc-inp ut search binarv bits ro at ching_aa Y of 
th e^stored contepts;.aGd g enerating an address corresp onding 
t o a location_inJhe CAM where the match is found. 

21 Claims, 9 Drawing Sheets 



COUBINED 
NEW HEADER 
ENCRYPTED DATA 



805^ 



PAYLOAD 
DATA 



EHCRYPTION • 
PARAMETERS ■ 



KEYS 



ML md 



mini & HEADER 



815 



P.T. 
CAM 
CHIP 



ENCRYPTOR | 

r 



745 -> 



EMCRYPTEO 
DATA 



755 



740 



750 



CLOCK 
OP-CODE 
MASK SELECT 



721 



720 



PROCESSOR 



CAM DATA 
(VPI/VCI) 
ADDRESS 

730 



701 



723' 



LOOK-UP 
TABLE 



17 



-755 



COMBINE 



HEADER 



I J 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent 



Nov. 24, 1998 



Sheet 1 of 9 



5,841,874 



MASK 
SELECT CODE 

DATA INPUT 

OPCODE 
INPUT 

CAM SEARCH 



CLOCK 



105 



± 



MASK WRITE 



DECODE 
—} 

no 



< 1 



CAH^ 
WRITE 



160 



100 



140 
> 



BINARY-TO-TERNARY 
CONVERTER 



B 



MEMORY CELL 1 



120 



130 
{ 



MEMORY CELL 2 



150 
S 



COMPARATOR 
(XOR) 



I 1 



170 



MATCH 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent 



Nov. 24, 1998 Sheet 2 of 9 



5,841,874 



120-^ 



130^ 



150-^ 



210 



221 



He 



222 



RA 



223- 



3V 



230 



RB 



231 



Jh 



J. 



i_rp_J7. 



B 



234 



r 



^253 ^254 



GND 



Ji5 

^ , CAM 

WRITE 



VDD 



GND 



MATCH 



J 170 



FIG. 2 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent Nov. 24, 1998 Sheet 3 of 9 5,841,874 



CLOCK 
105 



1 



PC 



WRITE "0" 



WRITE "1" 



RA 



RB 



WRITE 



WRITE CYCLE 

WIG. 3 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent 



Nov. 24, 1998 



Sheet 4 of 9 



5,841,874 



CLOCK "[ 



PC \ 



SEARCH FOR "1 



PULL HATCH LINE HIGH 



SEARCH FOR "0" 



MATCH |3Cy 



LOGIC 1 "STORED" 



RA J 



RB 



CLOCK 
MULTI- 
MATCH 
BUFFER 



MATCH 



j L 



SEARCH CYCLE 



NO MATCH 



i L 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent Nov. 24, 1998 



Sheet 5 of 9 



5,841,874 



MULTI- 
MATCH 
BUFFER- 
REGISTER 




325 
ADDRESS 



ADDRESS 
GENERATOR 
(ENCODER) 



■320 



DECIMAL 
TO BINARY 
VALID ENCODER 
MATCH 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent 



Nov. 24, 1998 



Sheet 6 of 9 



5,841,874 



CASCADE 
INPUT 



OPCODE 



SYSTEM 



DATA 




INPUT 






INPUT 




INPUT 




A. 






A 
















1 1 

•«c 


:ELD_STAf 


LU 

1 

LU 

CO 2 
1 -< 

^ o 

CO 1 
-< Q_ 


>_ENABLE 


o 


IABLE_CH] 


LU 
CO 




»— f 
Li. 


2 O 


O 




LU 


uu 



CD 
CO 



. r: III 

> ^ CO CO oo 

I 9^ CO CO CO 

J u- LU LU LU 

I Q£ Q£ 

j 2 O 

: < ezk o o 

s Q£ "«c --c 



405- 

300' 

405' 
410- 
453- 
455' 
460- 



SELF-TESTER 



INPUT BUFFER 



MARK REGISTER 
BINARY-TO-TERNARY 
CONVERTER 



2K WORD 
X 

64 ELEMENT 
TERNARY CAM 



WORD FINDER 



RAM 



o 
I— 
< 
cr 

CL 

o 



MULTI-MATCH BUFFER 



MATCH SORTER 



ADDRESS ENCODER 



'420 



2K WORD 
X 

16 BIT RAM 



450 



CASCADE 
LOGIC 



470 



OUTPUT BUFFER 



I I 



Q_ Q_ Q_ 

HH I— ( »— I »— I 
0-0.0.0. 

I I I I 
°s ^ ^ 

LU OD 



>- O I— 



•< LU 

LU r^i 



X X X X 



Q I *^ I 

— • °^ 

UJ CO o 



' — V — ' 

DATA 
OUTPUT 



SYSTEM 
OUTPUT 



OPCODE 
OUTPUT 



I I I I I 

LU Q LU CO 

— J ^-• Q£ CO 

09 1^ -J O LU 

to > I Q 

^ I CO o 

I 52 <^ ^ 

I to LU 



CASCADE 
OUTPUT 



500 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent 



Nov. 24, 1998 



Sheet 7 of 9 



5,841,874 



2K BIT INPUT FROM 
MULTI-MATCH BUFFER 



510 



1 ••• 1 M i 1 



WIRED-OR RAM 



2K 

BIT RESULT 
FROM RAM 
COMPARATOR 



16 BIT 
RAM OUTPUT 
TO CASCADE 
LOGIC 



ADDRESS DISABLER 



530 



i 1 1 M 



520 



MULTI-MATCH DETECTOR 



1 I 



MULTI-MATCH 
INDICATOR TO 
CASCADE 
LOGIC 



SELECTOR 



2K BIT 
OUTPUT TO 
ADDRESS 
ENCODER 



y 



ADDRESS ENCODER 



540 



550 



11 BIT ADDRESS 
(OF WRITTEN/MATCHED/DETECTED WORD) 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent Nov. 24, 1998 Sheet 8 of 9 5,841,874 




AB S'^O 



FIG. 8 



06/25/2003, EAST Version: 1.03.0002 



U.S. Patent Nov. 24, 1998 Sheet 9 of 9 5,841,874 



COMBINED 
NEW HEADER k 
ENCRYPTED DATA 




80S 



800 



I/F 



900 



710 



PAYLOAD 
DATA 



VPI/VCI 4 HEADER 



ENCRYPTION 
PARAMETERS 



KEYS 



700- 




^825 



ENCRYPTOR 



745^ 



ENCRYPTED 
DATA 



755 



740 



750 



COMBINE 



815 

I 



CLOCK 
OP-CODE 
MASK SELECT 

7 

721 



720 
f 



PROCESSOR 



CAM DATA 
(VPI/VCI) 
ADDRESS 

730 



701 



723^ 



LOOK-UP 
TABLE 



^735 



HEADER 



06/25/2003, EAST Version: 1.03.0002 



5,8^ 

1 

TERNARY CAM MEMORY ARCHITECTURE 
AND METHODOLOGY 

FIELD OF THE INVENTION 

The present invention relates generally to semi-conductor 
memory, and more particularly, to a cascadable content 
addressable memory device, and a system using a plurality 
of such cascaded devices. 

BACKGROUND OF THE INVENTION 

Content addressable memory devices (CAMs) are 
extremely valuable in providing associative look-up based 
on contents of the data. By pre-loading a CAM with a 
pre-defined data set, including the data to be compared and 
optionally data to be output when a match is found for a 
corresponding CAM location, the address where the match 
is found can be output as an index to the requesting device, 
or both the address and data can be output for each match. 
One problem incurred in using CAMs is that the construc- 
tion of CAM chips requires multiples of the number of 
transistors needed to implement the CAM memory that a 
standard read/write random access memory (RAM) would 
require. Thus, CAM chips are usually much smaller in depth 
size than RAM chips. Therefore, the capacity of a single 
CAM chip is frequently inadequate to provide for the 
necessary associative l ook-ups . Accordingly, it becomes 
necessary to use multiple CAM chips in some sort of 
cascaded or interconnected manner to provide greater depth. 
Thus, while a single CAM chip, for example, might provide 
Ik words of 32 bits of CAM capacity, the system's require- 
ments may require 8k or 64k or even 1 megaword of CAM 
capacity. 

Prior system designs have attempted to resolve the depth 
capacity problem by cascading chips in a seriatim manner 
where each CAM performed a look up function, and if no 
match was found, then the next CAM in the cascade chain 
would attempt its look-up and so forth, looking until a match 
was found. A major problem with this approach is that there 
is a variable latency in this architecture, where the time 
taken to find a match is widely variable from associative 
look-up to associative look-up, due to the fact that there is 
uncertainty as to how many CAM chips in the chain will 
have to be accessed, one at a time in turn, until a match is 
found. CAM Data Input fines must be run in parallel to all 
of the chips in the cascade chain, and control logic and 
intcrcoupUng must be provided between the multiple chips 
in the cascade chain. 

Binary CAM architectures, which store 1 and 0 data, have 
been the predominant technology used for content addres- 
sable memory (CAM) applications because of their speed 
and density advantages over theoretical ternary CAM 
architectures, which store 1, 0, and don*t care "x" data. This 
is because each ternary CAM data element contains two 
memory storage elements, and where static RAM-type 
memory cells are used, which require data and complement 
data, two clock cycles are required to write both the data and 
mask memory elements for each of the ternary CAM cells. 

Therefore, a need exists for a ternary CAM memory 
which can operate in a single clock cycle. With no speed 
disadvantage over binary CAMs, ternary CAMs could be 
used for such things as address resolution, filtering, 
mapping, virtual LANs, Asynchronous Transfer Mode 
(ATM) communications and systems, etc. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 illustrates a memory cell in accordance with the 
present invention for a ternary CAM cell semiconductor; 
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FIG. 2 illustrates a fourteen transistor static ternary CAM 
cell, corresponding to the memory cell 100 of FIG. 1, 
illustrated in further detail; 

FIG. 3 illustrates the timing waveforms for the write 
5 cycles to the CAM memory, in accordance with the coding 
provided as illustrated by Table 1; 

FIG. 4 illustrates the operation of the memory system 
during a search cycle, corresponding to FIGS. 1 and 2 and 
Table 2; 

FIG. 5 illustrates a memory array in accordance with the 
present invention, comprised of multiple memory storage 
subsystems 100 of FIG. 1; 

FIG. 6 illustrates a functional block diagram of a CAM 
15 memory system in accordance with the present invention, 
and illustrates external inputs and outputs to the memory 
system; 

FIG. 7 illustrates a detailed diagram of the match sorter 
and address encoder of FIG. 6 in further detail; 
20 FIG- 8 illustrates a state flow chart of the operation of the 
CAM memory system in accordance with the present inven- 
tion; and 

FIG, 9 illustrates a system embodiment of the CAM 
memory system of the present invention in an ATM data 
encrypting embodiment. 

DETAILED DESCRIPTION OF A PREFERRED 
EMBODIMENT 

3Q In accordance wit h the, pr eseDt invention, a temarvC AM 
architecture is provided, comprising a binary-to-temary con- 
version subsystem for providing first and second ternary 
data outputs responsive to binary^data jnput, a memory f or 
si rnultaneously ^toring the first and second ternary outputs 

35 an d providing a content addressable memor y, and a com- 
parat or for providing a match output resronsive to the 
com parison of binary data curr cqtly being input for com- 
parison as compared to the stored first and second tern ary 
d ata outputs in the memo ry. In the preferred embodiment, 

^ there are a plurality of memory cells, forming a plurality of 
multiple bit storage words for storing the first and second 
ternary data outputs for a plurality of words. Furthermore, in 
the preferred embodiment, a maskjegister_subs>^tem is 
provided that is utilized in_^rformin g the binar y^jo-temary 

45 data conversion of data in put signals^ both for pur poses of 
st_oritig_tb e ternary data outputs in the memnry and fn r 
piupQses_of providing ternary dat a input f or co mparison to 
t he stored ternary data . In the pretcrrcd embodiment, a 
plurality of addressable mask registers provide for the 

5Q selection of any one of a plurality of binary-to-ternary 
conversion options, both for purposes of storage, and inde- 
pendently for purposes of input data conversion for purposes 
of comparison. 

In acc ordance with one aspect of the present inve ntion, a 

55 method of storin g ternary ^data is provided, com prising the 
st eps of ( 1 ) initializing a conv ersjoB-re^sfe f hy^ stnrin^ 
bi nary-to-temary mask d a ta in a conversion re gister; (2) 
storin gJemarv^ata in a content a^dress ab le mem brv (CAM) 
bv_Jnputting_a.^in ^e_ bit^binary data to the^ co nversion 

60 registeri-^nd converting the binary datq i nto two bits of 
te rnary da_ta using the conversio n-register: and (3) simulta- 
neously storing the two bits of ternary data in first and 
second memory cells. 

In one embodiment, for subsequent searching, the method 

65 is further comprised of the steps of searching for a match of 
input search binary data to the stored contents of the CAM; 
providing a match valid output responsive to the input 
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search binary bits matching any of the stored contents; and output B to memory cell 130, the memory (CAM memory 

..generati o g an aHHrn^K £iQrrft r-jTn Tir4 i' n g fn - n-Wf^|fpn in th e sut^ystcm) devicc 160 IS able to store the first and second 

CAM w he re the matc ^h fmi^H The step of searching for a ternary outputs (A and B) in the CAM cell 100, in parallel, 

matcn ITSirt her comprised of the steps of converting the within a single period of the clock's signal. This advanta- 

input search binary data into ternary search data, usin^j he 5 geously provides for single cycle ternary data conversion, 

c onversion repis^|er; anf| cQnip^nng \\}e. cqntents of the CAM Comparison, and write cycles. The ternary CAM storage and 

tq the ternar y ^search data. retrieval system ("TCAM") is inherently superior to binary i 

Tieferring to FIG. 1, there is illustrated a memory cell for CAMs in masking operations, and useful in storage and/or 

a ternary CAM cell 100 in CAM memory subsystem 160, io conversion, and/or comparison. The TCAM system is usefull 

accordance with the present invention. An externally pro- 10 ^ multitude of applications, such as^ filtering , addi 

vided Opcode input is decoded by decoder 110 to provide resohUign, mapping. Virtual lANSTetc. ^ - 

control signals for Mask Write, CAM Write, and other Referring to FIG. 2, a fourteen transistor static ternary 

control signals tootherareas otlhe.chip, as_wiU be described CAM cell, corresponding to the CAM cell 100 of FIG. 1, is 

in greater detail hereafter. The CAM cell 100 is comprised illustrated in further detail in accordance with one embodi- 

of a fiist jacmory cell 120 and a second memory cell 130, 15 ment of the present invention. The first and second ternary 

whichareut ilized to store the ternary data for that CAM cell data outputs (A and B) from the binary-to-temary converter 

location. M emory cell 120 stores the 1 or 0 data values, 140, are coupled to the CAM cell 100. The memory cell 120 

while memory cell 130 stores a 1 for a don't care "x" value receives the A input and, responsive to the CAM Write 

selection, and a 0 to indicate that the memory cell contains signal, is output from the decode logic 110, enables transis- 

data that should be utilized. An external clock is provided to 20 210 to couple the A signal to the storage element portion 

the subsystem 160, where the clock signal 105 is utilized to of the memory cell 120, as appropriately timed and clocked 

synchronize operations within the CAM memory. responsive to the external Clock, as discussed in further 

Additionally, external Mask Select code signals and Data detail with reference to FIG. 3 for the timing diagrams for 

Input signais arc provided to the CAM memory 160 and are the write cycle. Transistors 220-223 form the static memory 

coupled to a binary-to-temary converter 140 that converts 25 ^^^^ storage element 120. The output RA from the memory 

the binary Data Input into appropriate ternary data signal cell 120, representing an inversion of the originally input 

outputs A and B, which respectively couple to memory cells signal A, is coupled to one input of the comparator 150, as 

120,130. The Mask Write, CAM Write, and CAM Search is described in greater detail hereinafter, 

outputs firom the decode logic 110 are coupled to the Similarly, the second ternary data output B is coupled to 

binary-to-temary converter 140. These outputs from the 30 the memory cell 130 of the CAM cell 100 and is coupled via 

decode logic 110 selectively control (1) the writing of the transistor 230, responsive to the CAM Write signal output 

Data Input into the mask registers and into the binary-to- from the decode logic, via transistor 230, to the memory 

ternary converter, responsive to the Mask Select code; (2) storage cell 130 comprised of transistors 230-233, forming 

the conversion of the Data Input by the binary-to-temary the storage element portion of the memory cell 130. The 

converter into ternary code; (3) the writing of the ternary 35 storage cell output from memory cell 130, output RB, 

code into the memory cells 120 and 130 responsive to the provides an output corresponding to the inverted value of B 

CAM Write decode output; and (4) the activation of a CAM as stored initially to the memory cell 130. The output RB is 

search for comparing the ternary code for the then present coupled to a separate input of the comparator 150. 

current Data Input to the CAM cell contents by a comparator In a preferred embodiment, the comparator 150 is com- 

150, which is clocked at the ^ropriate time to do the 40 prised of a four transistor exclusive-OR comparator. The 

comparison. comparator 150 is comprised of two subportions, subportion 

The Mask Select code, Data Input, Qock, and Opcode one comprised of transistors 251 and 252, and a second 

inputs are provided to the CAM memory 160 from an subportion comprised of transistors 253 and 254. On a CAM 

external device or subsystem, such as a processor, which search cycle, the then current search Data Input is converted 

controls and establishes the loading of the mask registers in 45 by the binary-to-temary converter to provide the first and 

the binary-to-temary converter 140, the loading of the CAM second temary data outputs A and B, which are coupled to 

data contents into the memory cells 120,D0 and the setup the comparator 150 transistor gates for transistors 251 and 

and request for comparison. The comparator 150 provides a 254, respectively. Correspondingly, the CAM memory cell 

Match output indicating that the Data Input either matches outputs RA and RB are coupled to the gates of transistors 

or doesn't match the contents of the CAM memory. The 50 252 and 253, respectively. Thus, the ternary data for a search 

binary-to-temary conversion subsystem 140 provides the is coupled to the gates of one transistor of each portion of the 

first and second temary data outputs (A and B) responsive to comparator and the corresponding stored temary data output 

the Mask Select signals, the Data Input signals, and the firom the CAM cell 100 is coupled to the gates of the other 

Opcode signals. The memory cell 120 stores and selectively transistor of the corresponding portions of the comparator, 

outputs the first ternary data output A, responsive to the 55 Each portion forms a transistor series tree, having one side 

binary-to-ternary conversion subsystem 140 and to the coupled to ground (e.g., the source of transistors 251 and 

Opcode signals. The memory cell 130 stores the second 254) and having the other side of the tree coupled to a Match 

ternary data output B, responsive to the binary-to-temary line output 170. The transistors on each side of the series tree 

conversion subsystem 140 and the Opcode signals. The are series connected, that is the other side (the drain) of 

comparator 150 performs an exclusive-OR comparison in 60 transistor 251 is coupled to the source of transistor 252, and 

the preferred embodiment, and provides a match output the drain of transistor 252 is coupled to the Match hne 170 

responsive to comparing the first and second ternary outputs and to the drain of transistor 253, which has its source 

provided by the binary-to-ternary converter 140 for the then coupled to the drain of transistor 254, which has its source 

current Data Inputs, to the outputs of the first and second coupled to ground. The Match line is pre-charged during the 

memory cells. Since the data storage to memory cells 65 first portion of each clock cycle to a high level, and 

120,130 is performed in parallel, that is, the binary-to- thereafter is pulled to ground, or low level, if no match is 

ternary converter provides output A to memory cell 120, and found, thus indicating no match. If there is a match, none of 



05/25/2003, EAST Version: 1.03.0002 



5,8^ 

5 

the transistors in the comparator 150 are enabled and the 
match line stays at a high level. 

It is important to note that the ternary converter outputs, 
A and B signals, are not bit and bit bar, but are rather the 
encoded two-bit state to be written into the two memory 
element CAM cells 120,130. Thus, the standard six transis- 
tor static RAM cell, which writes bit and bit bar into both 
sides of the memory element, is not utilized, but rather a five 
transistor memory cell is utilized that provides a single 
ended input structure, which is driven by the A line for 
memory cell 120 or the B line for memory cell 130. 

The comparison circuit 150, as illustrated, is comprised of 
four N-channel transistors. On one side (one portion) of the 
comparator, the gates of each of the two transistors are tied 
to the Aline and RA lines (transistors 251 and 252). On the 
other side (the other portion) of the comparator, the gates are 
tied to the B line and the RB line (transistors 254 and 253, 
respectively). On one side of the comparator, a comparison 
is made, between the incoming decoded A signal and the 
memory cell output RA signal (by transistors 251 and 252) 
and on the other side of the comparator, a comparison is 
made between the B signal and the RB signal (transistors 
254 and 253, rei^ectively). The binary -to-ternary converter 
140 performs a code conversion as provided in T^ble 1 for 
a writing table (where the Write line equals one), and as 
provided in Table 2 for a matching table, where the Write 
line equals 0, For either a write or search operation, the 
binary Data Input is encoded by the binary-to-teraary con- 
verter 140 to corresponding A and B outputs. The Tables 1 
and 2 show four ternary codes for conversion. In a preferred 
embodiment, the null state "N" is not used for writing or 
searching, and is used for pre-charge and test functions only. 

Table 1 illustrates the writing table where the write line 
equals one, providing the logic for the binary to the ternary 
converter 140 performing code conversion; 



TABLE 1 





(B-> 


T Conversion "Dibie) 






Ternary 


A 


B 


RA 


RB 


-N" 


0 


0 


1 


1 




0 


1 


1 


0 


-0" 


1 


0 


0 


1 


-X" 


1 


1 


0 


0 



As shown, Table 1 depicts the ternary symbol (N,1,0,X) 
and the corresponding ternary data outputs A and B, and the 
corresponding memory cell outputs RA and RB. 

Table 2 provides the code conversion for the matching 
operation for the binary-to-teraary converter 140 performing 
code conversion; 

TABLE 2 



Matching Uble fWrite q 0^ 



Ternary 


A 


B 


RA 


RB 


MA 


"X" 


0 


0 


X 


X 


1 


"1" 


0 


1 


X 


0 


1 










1 


0 


"0" 


1 


0 


0 


X 


1 








1 




0 


"N" 


1 


1 


0 


0 


1 








ELSE 




0 



As shown. Table 2 illustrates the matching table, showing 
the ternary symbol (N,0,1,X) and the corresponding ternary 
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data outputs A and B, plus showing the Match output 170 
resulting from a comparison of the ternary code for the input 
data to the stored memory cell output data (RA and RB). 
By way of example, and referring to Table 2, in conjunc- 

5 tion with reference to the comparator 150 of FIG. 2, the 
search data A is compared to the stored cell output RA, and 
if they are exclxisive of each other, that is, if only one of them 
is logic high, this will inhibit that side of the comparator 
(transistors 251 and 252) from pulling the match hue low. 

jQ Similarly, if RB and B (the search data B) are compared, and 
only one of them is high, indicating that there is a match, 
then that side of the transistor series tree will not be able to 
pull the match line low. When neither one of the sides of the 
transistor series trees 251 and 252, or 254 and 253, are able 

j5 to pull the match line low, this indicates that there is a match 
in the cell. Conversely, if there is not a match in the cell, for 
example, if both A and RA are high, this would turn on both 
transistors 251 and 252, which would connect the output 
point coupling 252 and 253 to ground, thus pulling the 

2Q match line low, thus indicating there is not a match. Where 
there are a number of memory cells corresponding to a 
single memory word, the match line for each memory cell is 
coupled to other match lines for that single word in a 
wired-OR structure, such that a number of memory elements 

25 can be cascaded together to form a word, and if any of the 
memory cells forming the multiple bit word don't match, it 
will pull the match line low, indicating no match. 

Referring to FIG. 3, the timing waveforms for write 
cycles to the CAM memory is illustrated for a write zero 

30 cycle and a write one cycle. First, it is noted that the coding 
of the A and B lines, as indicated in Tables 1 and 2, are such 
that only one of the A or B lines will transition at a time 
during write and search operation. 
As illustrated in HG. 3, there is a Clock (i.e., the Clock 

35 signal 105 of FIG. 1) which periodically cycles at a system 
clock rate. During the first portion of the clock cycle, a 
pre-charge (or setup) cycle occurs (i.e., a pre-charge signal 
goes high) to pre-charge the match line, and both the A and 
B lines are forced to an active low state, putting 0 on the A 

40 and B lines during pre-charge. This guarantees that inde- 
pendent of whatever data is stored in the memory cells 120 
and 130, that the match line can be pre-charged to a logic 
one without having any DC path in the cxclusivc-OR 
circuitry of the comparator 150. After the pre-charge cycle 

45 is over (i.e., the pre-charge signal goes low), the A and B 
lines are allowed to change. The match line must be pre- 
charged, since if the match line is pulled low, it will remain 
low, since there is no active device to pull it up. Thus, the 
match line has to be reestablished at a high level each cycle, 

50 to establish a base level of one for a match each lime. After 
the pre-charge signal goes low, a search operation can begin. 
This will be discussed in further detail with reference to FIG. 
4. The pre-diaige is not directly used in the write cycle, and 
is used directly for the search cycle, as illustrated in FIG. 4. 

55 However, the pre-charge period is relevant in that the A and 
B lines are forced low during the pre-charge cycle time. 

As illustrated in FIG. 3, after the pre-charge cycle (the 
pre-charge clock signal goes from 1 to 0), a write operation 
may be performed. As illustrated in FIG. 3, a write "0" cycle 

60 occurs after the first pre-charge cycle during the first Clock 
cycle. The A and B signals are set up by the binary-to-ternary 
converter 140 and coupled to the memory cells 120 and 130 
as illustrated in FIG. 1. Thus, as set forth in Table 1, to write 
a 0, the ternary code outputs A=l and B=0, are provided, as 

65 illustrated in E^G. 3. The RA and RB signal lines are initially 
at a 0 level, and upon the Write CAM signal (115 of FIG. 2) 
being clocked high, the 0 write data is clocked into the 
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memory cells 120 and 130 and correspondingly, output 
RA«0 and output RB =1, as illustrated in FIGS. 2 and 3, and 
Table 1. Referring to the Clock signal of FIG. 3, the Clock 
then transitions from 1 to 0, again initiating the pre-charge 
pulse to initiate the pre-charge cycle, forcing the A and B 
lines to a 0 level, and thereafter, when the pre-charge pulse 
goes back to a 0 level, the A and B ternary code outputs are 
set up for writing a 1. As shown in FIG. 3, consistent with 
the writing table of Table 1, the ternary code for a "1" is A=0 
and Bs*!. Upon the Write clock 115 of FIG. 2 clocking active 
high, the ternary code is written into the memory cells 120 
and 130, respectively, storing inputs A and B, and the 
memory cell outputs provide for an output of RA-1 and 
RB-0, consistent with Table 1, illustrating the operation of 
the CAM cell 100 of FIG. 2. The writing of a "don*t care" 
would be similar to that illustrated for the writing of a 0 and 
1, except that in this case, both the A and B signals would 
be at a high level (i.e., A=B=1), which would be clocked 
upon the Write clock signal going high into the memory cells 
120 and 130, correspondingly thereafter providing outputs 
RA=RB=0. Similarly, writing an "N" into the memory cell 
would write a OA) (A=B=0), providing a corresponding 1/1 
output from RA/RB (i.e., RA-RB-1), leaving it to the 
incoming A and B to determine if there's a match, since 
transistors 252 and 253 of comparator 150 would already be 
enabled (by RA=RB=1) for a comparison. 

Referring to FIG. 4, a search cycle is illustrated in 
accordance with a preferred embodiment of the present 
invention. This assumes that the memory cells 120 and 130 
have already been written to during a previoiis write opera- 
tion. FIG. 4 illustrates only the search operation. The exter- 
nal input of Data Inputs and Mask Select inputs, in con- 
junction with the Opcode input, cause the binary-to-ternary 
converter 140 to provide ternary code data A and B outputs, 
which arc ultimately fed to the comparator 150, appropri- 
ately timed as illustrated in FIG. 4. Analogous to FIG. 3, 
FIG. 4 illustrates the periodic Clock signal, Pre-charge clock 
signal, the ternary code outputs A and B, and the memory 
cell outputs RA and RB, and also illustrates the Match line 
output 170, corresponding to FIGS. 1 and 2 and Table 2. The 
timing diagram of FIG. 4 illustrates first a search for a 1 and 
then a search for a 0. During the first portion of the Clock 
cycle, illustrated as the first quarter of the Clock cycle, the 
A and B lines are forced low, and the pre-charge line is 
charged to a high level (which high level indicates a Match 
if it still exists after a search is done). At the end of the 
pre-charge clock, the A and B signal lines are set up with the 
appropriate data, and since this is a search for 1, A-0 and 
B=l, in accordance with Table 2. This is compared to the 
stored memory cell outputs RA and RB by the comparator 
150 as illustrated and described with reference to FIG. 2, and 
at the end of the first clock cycle, the Match line status is 
clocked into a bufifer, and the buffer output is then decoded 
to determine the address where the match occurred (where 
there are multiple memory words in the CAM). 

When the Clock goes low to start the next Clock cycle, the 
pre-charge signal goes high, forcing the A and B ternary 
code outputs low, to permit the Match line to pre-charge 
back to 1 if needed. Then, when the pre-charge clock signal 
goes low, a search for 0 is set up, with A-1 and B-0, as set 
forth in Table 2. Since the stored memory cell output RA-1 
and RB«=0, then both A«l and RA=1, thus causing transis- 
tors 251 and 252 of FIG. 2 to couple the Match line to 
ground, thus forcing the Match signal low as illustrated in 
FIG. 4, indicating no match has occurred. As illustrated in 
FIG. 4, a buffer clock signal occurs at the end of each of the 
Clock cycles, clocking the match status into a Match Buffer. 
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The Buffer stores the match state for each Match line word 
location in the CAM memory. 

Also, note that if RA and RB are low, regardless of what 
A and B are on subsequent search cycles, there will always 

5 be a Match output. Additionally, if the external system Data 
Input is converted into a ternary code of 0/0 for A and B (i.e., 
A=B=0) for a search, indicating a don't care, then the 
comparator will provide a Match output no matter what is 
stored in the memory cells. This permits a global don't care 
search. Thus, two don't care search options are provided. 
Initially, the system can process the data coming in and store 
don't cares into the memory. Subsequent to that, the system 
can also mask out certain bits of information in the CAM, 
using a don't care ternary code input for the search data. 
Thus, a "don't care" search data input ignores what's stored 
in the CAM cell, and causes the respective don't care bit 
locations to always match. It is to be understood that there 
are multiple bits in the word (64 bits in the preferred 
embodiment), and thus individual bits can be masked for 
don't cares during search in real time, or can be pre-stored 

20 as don't cares when loading and writing the CAM memory 
cells, so that a match is found irrespective of the search data 
for that bit. 

In a preferred embodiment, as illustrated in FIG. 5, the 
CAM memory 300 is comprised of a plurality of CAM cells 

25 100 cascaded both in rows as words (of 64 bits each), and 
as multiple rows comprising multiple (64 bit) words. Each 
word is comprised of 64 CAM cells 100. Thus, where 64 
memory subsystems are coupled together to create a 64 bit 
word, all 64 of these match lines arc coupled together in a 

3Q wired-OR structure, so that any element on the match line 
has the capability of pulUog the match line low. Thus, all 64 
bits would have to match in order for the final Match output 
for that word to be valid (high). 

Since the comparator 150 is an exclusive-OR type device, 
the inverted outputs RA and RB are compared to the 
non-inverted ternary code outputs A and B so that inverted 
stored ternary code data is compared by the comparator to 
the non-inverted ternary code search data. The effect of the 
exclusive-OR is to utilize the inverted to non-inverted 
comparison to affect a comparison that utilizes an inverted 

^ and non-inverted input to properly perform the comparison. 
Other comparator structures can alternatively be utilized, 
which use cither inverted outputs or non-inverted outputs 
from the memory cells 120 and 130, as inputs to the 
comparator, and the choice is one of design alternatives. 

45 Referring again to FIG. 5, a memory array comprised of 
multiple memory storage sub-systems 100 is illustrated, 
wherein there are a plurality of memory subsystems 100 
coupled in a row forming a word, illustrated as 4 bits wide 
(64 bits wide in the preferred embodiment) having a com- 

50 mon wired-OR match output (match 0) which is coupled to 
a multi-Match Buffer register. There are a plurality of rows 
of memory subsystems 100, forming a plurality of words 
each n bits wide, each row coupling its match line in 
common to all memory subsystems 100 of that row 

55 (memory word) providing corresponding match outputs 
(Match (1) to . . . Matdi (N)), which Match outputs are 
coupled to separate flq)-fiops (or other storage) of the 
multi-Match Buffer register 310. The array of memory 
subsystems 100 form the memory array 300, in the preferred 

60 embodiment being 64 bits wide by 2048 words deep. The 
multi-Match Buffer register 310 provides a valid Match 
output 370, which indicates a valid match has occurred, and 
one output for each CAM location, indicating that location's 
match status. Thus, the depth problem is solved, and both 

65 multiple bit words and multiple word arrays of ternary CAM 
storage are eflBciently provided without speed operational 
penalties. 
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The CAM memory system of FIG. 5 includes the memory 
array 300, the multi-match buffer register 310, and the 
address generator (encoder) 320, plus other logic as 
described elsevs^ere herein. The multi-Match Buffer register 
provides a plurality of outputs, corresponding to the match 
status for each row or word of the memory array 300, and 
the Buffer register 310 outputs are coupled to the address 
generator 320, which performs a decimal to binary encoder 
function to provide an address output 325, representative of 
the address where the first matdi occurred. In the preferred 
embodiment, the encoding is from 2048 registers to a 10 bit 
binary code. This then provides an address output firom the 
CAM memory system which indicates the address corre- 
sponding to the first valid match. 

Referring to FIG. 6, a functional block diagram of a CAM 
memory system is illustrated, wherein external system 
inputs are provided as Data Input, Opcode Input, System 
Input, and Cascade Input, and where the memory system 
output is comprised of a Data Output, Opcode Output, 
System Output, and Cascade Output. Self testing is possible 
through the use of the Opcode Input and System Inputs, and 
can be performed by a self-test sub -system 405, and can be 
optionally provided depending on design considerations. 
TTie data coming in through the Data Input is coupled 
through the input buffer 410 to mask registers 440, which 
provide for binary-to-ternary conversion, in accordance with 
Tables 1 and 2. A host processor system provides the Opcode 
and Data Inputs necessary to load the mask registers to 
determine when don*t cares are to occur. Thus, initially the 
host processor provides appropriate Mask Select inputs and 
CAM Data Inputs to load the mask registers. In a preferred 
embodiment, 8 mask registers are used, and a 3 bit Mask 
Select input can provide selection of one of the 8 mask 
registers. Upon completion of loading of the mask registers, 
the host system provides either a 1 or 0 (binary data), and a 
Mask Select code, and the binary-to-temary converter con- 
verts the binary input into ternary code data as described 
earlier with reference to Tables 1 and 2 and FIGS. 1 and 2. 
The mask register is the width of the CAM words, 64 bits in 
the preferred embodiment, and any one of those bits, or any 
combination of those bits within each mask register, can be 
set to a logic 1 to mask out any bit location within the word 
being written into the CAM as a don't care. If there is a 0 
in the mask register, then the data (i.e., 0 or 1) is appropri- 
ately written into memory cell 120 of FIG. 1. When selected 
during CAM Write, the CAM memory is written into so that 
memory cell 120 is written with a binary 1 or 0, and 
dependent on the mask register bit, a 1 or 0 is written into 
memory cell 130, indicating either a don't care where a 1 is 
written into memory ceU 130 for those mask register bits 
having a 1, or that valid data (i.e., a 1 or 0) exists in memory 
cell 120 where a 0 is written into memory cell 130. In one 
embodiment, RAM storage is provided as a part of the CAM 
memory system, illustrated as RAM 450, which provides 
storage corre^onding to each word of the CAM array. In the 
preferred embodiment, 2K words by 16 bits of RAM are 
provided corresponding to the 2k words of CAM. However, 
the width is determined by design constraints, in a preferred 
embodiment, 16 bits. 

The RAM is loaded by the host system (or other external 
device), using the Opcode Inputs, and the RAM Data Inputs 
of the Data Input, to provide for loading of the RAM with 
contents that are thereafter provided as outputs for matched 
CAM locations, responsive to the multi-Match Buffer 410 
and the RAM comparator 420, to provide RAM data output 
(for the matched Cam location) to the output buffer 460. The 
output buffer 460 provides the Cascade Output for a multiple 
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memory system chip cascaded configuration including the 
RAM next output, the address output indicative of a match 
being found, and other control signals to permit cascading, 
as well as feed-through of the Data Output corresponding to 

5 the Data Input via CAM data pipe 0-63, and RAM data pipe 
0-15. Additionally, the output buffer 460 provides feed- 
through of the Opcode input including Field Start pipe. 
Mask Select pipe, OpCAM pipe 0-3, and OpEnable pipe 
0-2. The System Output feeds through the Clock, and the 

10 Ready, and RAM Zero System Output, The outputs provide 
for multiple ones of the memory chip systems 500 for 
coupling to one another to form an array (e.g., having a 
depth greater than 2K), but providing for output of the 
address and data (where the RAM is present within the 

15 system 500) to provide the indication of the address and 
corresponding RAM data for the first match location. The 
cascade logic 470 provides for cascade logic and feed- 
through of appropriate signals from the Cascade Input, the 
RAM, the multi-Match Buffer, and the address encoder to 

20 provide for the Cascade Output. The address encoder 455 is 
coupled through the Match Sorter 453 to the multi-Match 
Buffer 410, to provide a decode of the Match Buffer 410 and 
provide the corresponding address where the match was 
found. The Word Finder logic 405 provides necessary 

25 decode logic between the CAM memory array 300 and the 
multi-Match Buffer 410 to provide appropriate logic for 
interfacing to the multi-Match Buffer 410 and the RAM 
comparator 420. 

Referring to FIG. 7, the main components of the match 

30 sorter and address encoder of FIG. 6 are provided in greater 
detail. The 2 kilobits of output from the multi-match buffer 
410 are coi^iled as inputs to a wired-OR RAM 510, which 
provides a 16 bit RAM output to the cascade logic 470 as 
shown in FIG. 6. The wired-OR RAM 510 corresponds to 

35 the RAM array 450 and RAM comparator logic 420 of FIG. 
6. The output of the wired-OR RAM is also coupled to an 
address disabler 520, which selectively disables the address 
output responsive to the output from the RAM comparator 
420. The address disabler output selectively provides an 

40 address disabling output to the multi-match detector 530, 
which provides an indication of a multiple match indicator 
to the cascade logic 470, and detects if more than one of the 
2K lines are active. The selector 540 picks the first matched 
word and disables all others (the lowest address having the 

45 highest priority in the illustrated embodiment), and provides 
2 kilobits of output to the address encoder to be encoded into 
an 11 bit address output from the address encoder 550 (as 
discussed in detail with reference to FIG. 5 address genera- 
tor encoder 320 and analogous to the address encoder 455 of 

50 FIG. 6) indicative of the written, matched, or deleted work, 
depending on the type of operation being provided. 

Referring to FIG. 8, a state flow diagram is provided 
illustrating the operation of the system in resetting, loading 
mask registers, writing CAM words, and performing a CAM 

55 search. Initially, at state 610, the external system provides 
for a chip Reset that initializes everything (e.g., all memory 
cells and mask registers) to 0. Thereafter, the next state 620 
provides for loading of the mask registers. In the preferred 
embodiment as discussed above, there are 8 mask registers, 

60 however a single mask register can be used multiple times, 
and the choice is one of design alternatives. Responsive to 
the Opcode Inputs and Mask Select bits, and the Mask Write 
signal, the CAM Data Input is written into the mask registers 
at step 620, as indicated via the signal inputs coupling into 

65 step 620 from the left and right. To avoid applying any mask 
to the CAM Data being written in, one ma^ can be reserved 
and loaded with all O's, which provides for no masking 
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function. When writing to the CAM, at step 630, the Opcode 
combination provides appropriate signals, and the Mask 
Write is inactive, indicating a Mask Write is not occurring, 
and that the write cycle is a CAM data write. The CAM 
Write signal is active, and the CAM Data Inputs are coupled 5 
into and written via a selected mask register as determined 
by the Mask Select bits. This results in the binary-to-temary 
converter generating the appropriate A/B signals that go into 
the CAM memory elements for storage. Once the CAM 
memory array has been written to and the data stored (and 
where a RAM is located internal to the CAM memory 
system the RAM is also loaded as appropriate via RAM Data 
Inputs and appropriate Opcode signals), a CAM Search 
operation can be provided. The CAM search operation is 
responsive to a CAM Search Opcode, and the CAM Data 
Input, and a selected mask register as selected via Mask 
Select bits, which, as shown at step 640, provides for 
determining whether there is a match between the then 
current input CAM Data as compared to any stored value in 
the CAM memory array. If a match is found, the corre- 
sponding match lines are latched (in the multi-Match Buffer) 
and the output from the chip system is an address indicating 
where the match occurred. Additionally, where RAM is 
present on the chip, the RAM data for the corresponding 
CAM word location where a match was found is also output 
from the chip. 

Referring to FIG. 9, the memory system of the present 
invention is illustrated in an encryption embodiment for use 
in conjunction with an Asynchronous Transfer Mode (ATM) 
system. During an initial call setup, the^ATM-netwprk 800 
pro vides Jorjpommunijgttipn of informationj cgimlcid-yia bus 
SffiStointerface 710 to establish a call setup procedure prior 
to performing a write operation. The AX M mepiory system 
embodiment system 900 provides for storing of new ATM 
virtual address (i.e., as commonly denoted by its \^rtual P ipe 
Idenlififir (VPI) and Virtual Channel Identifier (VC D^ link 

data to be setu p and stored intn the CAM memnry array of 

me mory system 700 by performing the CAM- Write cycle 
process. I n accordance with the pre sent invention, a te rnary 
CAM system is provided that provideslFor ternary inform a- 
tio n being written into thclernary CAM ^lls in a single 
Clock cycle, which allows for the vm ^g of- a continu ous 
stream of ATM, messages, instead of having to stall drge lay 
the ATM system to facilitate a multiple cycle ternary CAM 
Write with risk of cell loss. In typical apphcations, an entire 
^^l^-JlLYP^^^^-^^"^^'''^"^^^*^"'^ af^Hrey ; information is 
s etup in the CAM memory cells, IheJookup tabl e, and the 
internal HAM if present, all in one con tinuous set^ f 
o perations rather than^just one J ocatio^. A real-tune com- 
munication networ k is thereby provided . 

After initial setup, communications from the ATM net 800 
via coupling 805 is made to an interface 710, which strips off 
the VPIA'^CI portion of the header from the pay load and 
remaining header potion of the ATM cell, and sends the 
VPIA^CI and remaining header, via coupling 815, to the 
processor 720, The processor 720 provides the appropriate 
Clock, Opcode, Mask Selects, CAM data, and other appro- 
priate input signals via coupling 721 to the ternary CAM 
memory system 700. The ternary CAM memory system 700 
can be comprised of one or a plurality of cascaded CAM 
memory systems of the type disctissed elsewhere herein. 
After setup is complete, the CAM search (and lookup table) 
can be utilized. 

The CAM Data from the processor, which is requesting a 
compare, is the stripped-off VPI/VCI portion of the header, 
which is compared to the contents of the CAM memory 700, 
which in turn provides an address output 701 when a match 
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occurs. The address output 701 is coupled back to the 
processor 720 and to a lookup table 730. During setup, the 
processor 720 loads the lookup table 730 with data, via 
coupling 723, corresponding to the Address output of the 
CAM 700. The lookup table 730 outputs specific encryption 
parameters 735 re^onsive to the address output of the CAM 
memory system 700. The lookup table 73M) provides the 
encryption parameters 735, which can be a unique key or 
some mechanism that sets up an cncryptor 740. The cncryp- 

) tion parameters 735 are coupled to the encryptor 740, which 
is also coupled to receive the payload data portion of the cell 
825, as provided by the interface 710. The encryptor 740 
then, encrypts the payload data in accord ance w igi the 
specific encr yption parameter keys"^ provided by t he 

; ^ofj2111* ^blfiJ^30,^ which are uniquely associated with the 
specific Wlj/Vc Taddress that was input as CAM Data into 
the CAM system 700. The encrypted data output 745 from 
the encryptor is coupled to a combiner 750, which recom- 
bines the encrypted data of the payload with the header, 

) including the VPWCI address, and provides a combined 
new cell comprising the header and encrypted data as output 
at 755 for coupling back to the ATM network 800 for 
communication therefrom to the appropriate destination. 
Tt^fiJftftk^ipUah lc 730^ while illustrated external to th e 

5 CAM-memory system 7UU, can alternatively be provided" as 
ajjart of the CAM^meni or y ^sy stem 700. H owever, to provide 
5ii£5cient encr yption para meters', it is desirable to have more 
than a 16-bit wide amount of RAM. Thus, to maintain cost 
effectiveness of the CAM memory chips of the memory 

0 system 700, the lookup tab le can be. prov i ded ex teroaUvand 
addressed_ iesponsi3 &g^tq the address output from the C AM 
memory j^y st^n^ 700, to a Al^flejabilit^Jo^ . 
The RAM within the CAM chip itself, where present, can be 
used to provide sync pulses, cnd-of-frame indicators, and 

5 many other simpler functions than the encryption 
parameters, and can be provided in addition to the lookup 
table 730. Thus, the presence of the RAM within the CAM 
memory system 700 is optional, and if present, can be 
supplemented by an external separate lookup table. Since 

0 not every CAM address needs to have a lookup table 
encryption, an external lookup table can be used with a 
much denser lookup function than an on-chip RAM. In one 
embodiment, the RAM is on-chip within the CAM memory 
system 700, and the lookup table is integrated internally, 

5 eliminating the need for the external lookup table 730. 
The lookup table is loaded as appropriate, corresponding 
to the CAM cell loading, via the processor 720, monitoring 
when a write operation is performed into the CAM memory 
700, and then providing a CAM address output 701 from the 

0 CAM 700, which indicates the memory location that is 
actually written to. Subsequent to that, the processor 720 
ta kes the appropriate action to load in the lookup table a n 
ap^ ^hate mappmg otlhe encryption parame ters as nec- 
es ^ to support that V^I/VCI address. Hveij WttETfe the 

5 lo okup tablej sin JlAM internal to the CAM mem orysystem 
700, the processor still monitors andjcewrites i nto the RAM 
appropriately to load the encryption parameter data needed. 
TTie processor 720 provides the Mask Select, Data Input, the 
Opcode Data input, the Qock, and other necessary param- 

0 eters for use by the CAM memory system 700. The proces- 
sor 720 processes thcVP I/VCI and.remainder of the Jieadcr, 
andjdgtcrmige sjhe ne^ ag propjjiate^step. In the preferred 
embodiment, the, VPI and VCl portion and the remainder of 
the header are typically not encrypted or transformed by the 

iS encryption system as illustrated in FIG. 9, and are recom- 
bined with the encrypted data by the combiner 750. 
Alternatively, the VPI/VCI could be re-mapped via the 
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processor and VPIA^CI mapping contained either within the plurality of data input signals based upon the mask 

CAM system 700 as RAM or utilizing another external selectsignals, the plurality ofdata input signals, and the 

memory system, to provide a new VPI/VQ address to be opcode signals; and 

recombined with the remaining original header and the 3^ i^^^ memory subsystem, comprising: 

encrypted data. 5 ^ first memory cell for storing the first ternary data 

The encryptor 740 provides a method of scrambling the output for the first data input signal in response to the 

input data based on certain encryption parameters, which binary-to-temary conversion subsystem and the 

can be any sort of scrambling and encryption, such as keys ^ signals, said first memory cell having an 

for a specific user path. The encryption parameters in the outout* 

lookup t^le are thus loaded in accordance with some ^ ^ ^^^^ 

predefined encryption algonthms to provide the necessary / ^ . ^ 

parameters for the encryptors 740. The keys arc loaded as , T. T. • \ T 1/ o ^ L^y^Ju^ 

appropriate, so that each respective VPWCI address has ^? bmary-to- temaiy conversion subsystem and 

associated with it its own key, or no key, so that the the opcode signals, said second memory cell having 

corresponding destination address system can decode the ^° output; and 

encrypted data on the other end with that unique key. The ^ comparator for companng, in response to the opcode 

lookup table must provide the appropriate equivalent key, so signals, the first and second ternary data outputs for 

the encryptor encodes the payload data in accordance with the second data input signal to the outputs of the first 

the key that is going to be used on the other side when the and second memory cells, said comparator output- 

payload data is decoded. ting a first match output 

During the initial call setup from the ATM network, 20 2. The memory apparatus as in claim 1, further compris- 

messages arc passed back and forth to define what keys (e.g., ing: 

encryption parameters to be stored in the lookup table) can means for supplying a periodic clock signal; 

be used, what algorithms, which VPI/VCI locations have wherein the memory apparatus stores the first and second 

access, and various other parameters that can be defined for ^^^^^ o^jp^t^ fo^ gj^j data signal within one 

the encryption process. An agreed-to miUal key can be used 25 pe^od of the periodic clock signal. 

to encrypt the initial data that is sent with a common pubhc 3 apparatus as in claim 1, wherein the 

key that aU users have, and thereaftei, pnvate keys are binary-to-temary conversion subsystem is fiirther comprised 

utilized for encryption and decodmg. The pnvate key ^ ^f. 

unique for a VPWCI pair, although multiple VPWCI pairs * , . , ^ , - , . ^ 1 

can have the same key. Tbc processor 720. responsive to the 30 ' subsystem for selectively stormg the cbta 

loading of the CAM, provides for loading the lookup table mput signaU dunng a setup cycle and for thereafter 

with the corresponding keys for certain addresses in encodmg the data mput signals mto the first and second 

response to communications from the ATM network 800 of , l^^^^V '^^^ ""'P^** °" select signals^ 

key values for certain VPl/Va addresses. Tlie interface 710, ^: ^ '°«'°°fy apparatus as w claim 3, wherein the mask 

the ternary CAM memory system 700. and the processor 720 35 "^^^^ subsystem is comprised of a plurality of mask 

provide tianslaUon of the VPWCI addresses to addresses "^f^^ addressable using the mask selert signals, 

for encryption keys for the respective VPI/VCI addresses, , "'«°'°'y app«atus as m claim 1. wherein said at 

responsive to the ternary CAM 700 output 701. TTie output l^f^. "'"'T'y subsystem mdudes a plurahty of memory 

701 provides the address to the lookup uble 730 which suteystems that^ arranged m rows to form a plurality of 

provides the encryption parameters 735 as necessary to 40 ^ ip e Di wo , 

encrypt the payload data 825 by the encryptor 740. The wherem the first match outputs for the memory sub- 

encryption payload data is combined by the combiner 750 systems forming each multiple bit word are connected 

with the header for output 755 to the ATM network 800. together m a wircd-OR connection, the logical-OR 

The ATM system benefits by utilizing oflf-loaded key connections for the plurality of multiple bit words 

encryption of payloads, independent of address routing 45 outputtmg a second match output, 

information (e.g., VPI/VQ), which is first stripped, and. , inemory apparattis as m clami 5, wherem each of 

after encryption, re-appended ftomAo the payload. This the plurality of mulUple bit words has an associated address 

encryptionofpayloadscanbeperformedtransparenUytothe 1^^^^^^°* ^PP^^^ composing: 

ATMs* other network operations. This benefit can also be ^i^ans for outputtmg an address location corresponding to 

utilized by other communications schemes, where a portion 50 second match output. 

is stripped oft, is encrypted, and then recombined for trans- . Th^ memory apparatus as in claim 6, further compns- 

mission. On the receiving side, the same associative lookup/ ^S* 

mapping is used to determine the encryption keys, and the means for oulputting a data output associated with the 

encrypted payload is then de-cncrypted using the encryption address location corresponding to the second match 

keys. 55 output. 

What is claimed is: ^- ^ method of storing ternary data, comprising the steps 

1. A memory apparatus for coupling to an external device, of • 

the external device outputting (a) a plurality of data input initializing a binary-to-teraary conversion register by stor- 

signals to be processed in the memory apparatus, the plu- ing binary-to-temary mask data in the binary-to -ternary 

rality of data input signals comprising at least a first data 60 conversion register, wherein the binary-to-temary mask 

input signal and a second data input signal, (b) mask select data comprises information relating to the location of 

signals for specifying a data mask indicating desired loca- don't care bits for input data; 

tions of don*t care bits for the data input signals, and (c) inputting a single bit of binary data to the binary-to- 
opcode signals for specifying operations to be performed by ternary conversion register, 

the memory apparatus, the memory apparatus comprising: 65 converting the single bit of binary data into first and 

a binary-to-teraary conversion subsystem that generates second ternary data outputs using the binary-to-ternary 

first and second ternary data outputs for each of the conversion register; and 
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simultaneously storing as stored content addressable 
memory (CAM) data the first and second ternary data 
outputs in first and second memory cells forming a 
content addressable memory. 

9. The method as in claim 8, further comprising the steps 

f: 

providing input search binary data; 

searching for a match of input search binary data to the 

stored CAM data; and 
generating a match valid output when the input search 

binary data matches any of the stored CAM data. 

10. The method as in claim 9, further comprising the step 

of: 

generating an address corresponding to a location in the 
CAM where the match is found. 

11. The method as in claim 9, wherein the step of 
searching for a match is further comprised of the steps of: 

converting the input search binary data into ternary search 
data, using the binary-to-temary conversion register; 
and 

comparing the stored CAM data to the ternary search data. 

12. A memory system comprising: 

a binary-to-ternary conversion register; and 
means for initializing the binary-to-temary conversion 
register, comprised of: 

means for storing binary-to-temary mask data in the 
binary-to-temary conversion register, said binary-to- 
temary mask data comprising information relating to 
the location of don't care bits for input data; 

a content addressable memory (CAM) comprising at 
least first and second memory cells for storing ter- 
nary data; 

means for storing the temary data in the CAM, com- 
prising: 

means for inputting a single bit of binary data to the 

conversion register; 
means for converting the binary data into two bits of 

ternary data using the conversion register; and 
means for simultaneously storing as stored CAM 

data the two bits of temary data in the 
first and second memory cells of the CAM. 

13. The memory system as in claim 12, further compris- 
ing: 

means for supplying input search binary data; 

means for searching for a match of input search binary 

data to the stored CAM data; and 
means for generating a valid match output when the input 

search binary data matches any of the stored CAM data. 

14. The system as in claim 13, further comprising: 
address generator means for generating an address corre- 
sponding to a location in the CAM where the match is 
found. 

15. The system as in claim 13, wherein the means for 
searching for a match is further comprised of: 

means for converting the input search binary input into 
temary search data, using the conversion register; and 

means for comparing the contents of the CAM to the 
ternary search data. 

16. A memory system for providing secure asynchronous 
transfer mode (ATM) communications for an ATM network 
that transmits a plurality of data cells, each of the plurality 
of data cells comprising paybad data and header data 
comprised of virtual path identifier (VP!) address data and 
virtual channel identifier (VCI) address data, the memory 
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system receiving a plurality of data input signals, encryption 
VPl and VCI addresses, and associate key data signals, the 
memory system comprising: 

a ternary content addressable memory (TCAM) sub- 
^ system for storing a first one of the data input signals 
as stored ternary data, and for determining a match 
output address by comparing another one of the data 
input signals to the stored temary data; 
10 an addressable lookup table subsystem for storing the key 
data signals and seleaively outputting the key data 
signals in accordance with the match output address; 
wherein the TCAM subsystem and the addressable lookup 

table subsystem form a memory subsystem; 
means for initializing the memory subsystem comprising: 
means for storing the encryption VPl and VCI 

addresses in the TCAM subsystem; 
means for storing the key data associated with the 
encryption VPl and VCI addresses in the lookup 
table; 

means for separating the payload data from the header 

data for each of the data cells; 
means for delivering the separated header data to the 
TCAM, wherein the TCAM selectively outputs the 
match output address when the separated header data 
matches one of the stored VPl and VCI addresses; 
wherein the lookup table outputs the key data associ- 
ated with the respective match output address; 
means for encrypting the payload data in accordance 

with the key data; and 
means for combming the encrypted payload data with 
the separated header data to form an encrypted data 
cell 

17. The memory system as in claim 16, further comprising 
means for transmitting the encrypted data cell to the ATM 
network. 

18. A memory system for implementing a secure asyn- 
chronous transfer mode (ATM) communication system that 

^ transmits a plurality of signals comprising associated 
encryption key data, virtual path identifier (VPl) data, virtual 
channel identifier (VCI) data, and data cells, each data cell 
comprising payload data and header data, said header data 
comprised of VPI/VCl data, wherein respective ones of the 
encryption key data are associated with respective ones of 
the VPI/VCI data, the memory subsystem comprising: 
a content addressable memory (CAM) subsystem for 

storing data therein; 
an addressable lookup table; 
50 a processor for storing VPI/VCI data into the CAM 
subsystem, each at a re^ective storage address, and for 
storing the respective associated encryption key data 
into the lookup table at a location mapped to the 
respective storage address; 
a decoder for separating the header data from the payload 

data for each of the data ceUs; 
wherein the CAM subsystem includes means for compar- 
ing the stored data therein to the separated header data 
to selectively determine a match address output when 
the separated header matches any of the stored data 
therein; 

wherein the lookup table outputs the associated encryp- 
tion key data based on the match address output; 
65 an encryptor for encrypting the separated payload data 
based on the encryption key data output from the 
lookup table; and 
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a combiner for combining the encrypted payload data 
with the separated header to form an encrypted data 
cell. 

19. The memory system as in claim 18, further compris- 
ing: 

means for communicating the encrypted data cell 
through standard ATM infrastructure systems. 

20. The memory system as in claim 19, further compris- 
ing: 

means for receiving the communicated encrypted data 
cell; 

a decoder for separating the header data from the payload 
data for each of the communicated data cells; 

wherein the separated communicated header data is com- 
pared to the stored data in the CAM subsystem; 

wherein the CAM subsystem compares the stored data 
therein to the commimicated separated header data to 
selectively determine the match output when the com- 
municated separated header matches any of the stored 
data therein; 

a decrypter for de-encrypting the communicated 
encrypted separated payload data based on the encryp- 
tion loey data; and 

means for communicating the de-encrypted separated 
payload data. 

21. A memory apparatus, comprising: 

a plurality of memory locations each capable of storing at 
least one data record having multiple bit positions; 
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means for receiving data records from an exterior envi- 
ronment for storage in said pluraUty of memory loca- 
tions; 

means for storing said received data records in said 
plurality of memory storage locations; 

means for receiving a search data record from an exterior 
environment for comparison with data records stored in 
said plurality of memory locations; 

means for comparing said search data record to multiple 
data records stored in said plurality of memory 
locations, to determine whether a record match exists 
between said search data record and one of the multiple 
data records, said comparing means comparing each bit 
position of said search data record to a corresponding 
bit position in each of said multiple data records; 

means for storing first don't care bits in selected bit 
positions within said multiple data records stored in 
said plurality of memory locations, said first don*t care 
bits each resulting in a bit match when compared with 
data bits in said search data record irrespective of the 
content of said search data record; and 

means for adding second don*t care bits to selected bit 
positions in said search data record, said second don't 
care bits each resulting in a bit match when compared 
to data bits in said multiple data records irrespective of 
the content of said multiple data records. 
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ABSTRACT 



The invention relates to a CAM/RAM memory device with 
a scalable and flexible structure. The device has a number of 
rows of memory ceUs. At least on e address decoder is 
connected by word liaes to the cells of the rows. Verb al bit 
li nes"for match3ata Jm alement_CA l^iJiun^i6nalhvl)f the 
memory device. According to the invention a mask is 
implemented in a row of the memory cells, the mask 
affecting the match data on the bit lines. Preferably, the 
memory device is divided into segments with a mask at the 
top of each segment. By means of the present invention, 
masking is obtained by inserting masks as mask rows 
between the CAM rows. The mask rows are programmed by 
writing the mask row cells in the same way as the CAM 
cells. The mask rows operate directly on the bit lines for the 
whole underlying segment of rows. By means of this 
arrangement, the invention makes efQcient use of the avail- 
able silicon area. The memory device has a useful applica- 
tion as a device for handling address look-up, e.g. in a switch 
or router. 

15 Claims, 3 Drawing Sheets 
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MASK ARRANGEMENT FOR SCALABLE 
CAM/RAM STRUCTURES 

FIELD OF INVENTION 

The present invention relates to a CAM/RAM memory 
device with a scalable and flexible structure. CAM/RAM in 
this document means that the memory device functions both 
as a CAM (Content Addressable Memory) with matching 
operations and also has all ordinary RAM (Random Access 
Memory) functionality, i.e. a memory device with a CAM 
match operation mode and a RAM addressed read mode, 
where selectable parts of the memory cells can be masked 
ofif for the CAM mode and reading and writing may be 
performed directly through an address decoder in the RAM 
addressed read mode. 

By means of the present invention masking is obtained by 
inserting masks as mask rows between the CAM rows. The 
mask rows are programmed by writing the mask row cells in 
the same way as the CAM cells. The mask rows operate 
directly on the bit lines for the whole underlying segment of 
rows. CAM/RAM memory devices are especially useful, but 
not limited to, address look-up devices. The memory device 
may employ a blodc structure for an efficient use of the 
silicon area. Blocks are arranged in parallel with common 
word lines running through all the blocks and operative in 
both RAM read/write and CAM look-up. 

STATE OF THE ART 

CAM memories are attractive for use in address look-up 
devices, and there are many different implementations of 
doing so. A general structure contains a CAM memory 
where selectable parts can be masked off to implement 
different look-up schemes and get the ability to handle 
hierarchical addresses. To get an efficient handling of 
address learning and changes of the look-up table it is also 
desirable to have a CAM that in parts works as an ordinary 
RAM. For address flexibility it is also desirable to have the 
partitions into CAM and pure RAM configurable. There 
exists a number of different ways of achieving these memo- 
ries and examples include those described in e.g. U.S. Pat. 
No. 5^83,146 (Threewitt) and U.S. Pat. No. 5,706,224 
(Srinvasan et al.). 

When using a CAM/RAM structure as a part of an 
integrated device, the form factor and size are very impor- 
tant parameters. This is the case of a pure CAM chip as well 
but more so in the case of integration into single chip 
devices. This is important to consider \^en deciding how to 
implement the masks desired for masking out bits not to be 
taken into account in the match operations. The common 
way of implementing the masks are either to have separate 
mask registers as in the above U.S. patents, or to implement 
a mask into each memory row. Implementing a mask into 
each memory row (so called ternary CAM cells) is really not 
desirable for these kinds of area sensitive implementations, 
unless the application itself demands a mask for every row, 
since it increases the size of all CAM cells. The implemen- 
tation with mask registers on the other hand requires extra 
logic and addressing for writing to these registers, and it is 
also not suitable for implementing more than a few mask 
rows. So, the problem is to find a way of efficiently imple- 
menting a stmcture where a rather small subset of the 
memory array shares a mask, e.g. for every 15 CAM rows 
there is one mask row. 

The present invention solves the problem of enabUng 
flexible and area-efficient mask implementations by insert- 
ing the masks between CAM rows in the memory array and 
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masking the data in the bit fines. This gives a more flexible 
stmcture than the common way of placing the mask register 
and masking function before match data affects the bit lines 
as shown in FIG. 1. This also allows fine granularity and 

5 efficient integration into the memory array. Also, since no 
change is made to the CAM cells, these do not suffer from 
increased area. 

By implementing a number of the memory rows as mask 
rows only, instead of CAM rows, it is possible to address 

10 these in the same way, and with the same address decoder as 
the rest of the memory. It gives a flexible way of imple- 
menting different mask granularities for different CAM 
implementations when designing a system. 

IS SUMMARY OF THE INVENTION 

The present invention provides a CAM/RAM memory 
device with a scalable structure comprising a memory 
having a number of rows of memory cells. At least one 
address decoder is connected by word lines to the cells of the 
rows. Vertical bit lines for match data implement CAM 
functionality of the memory device. 

According to the invention a mask is implemented in a 
row of the memory ceUs, the mask affecting the match data 
25 on the bit lines. 

In a preferred embodiment, the memory device is divided 
into segments with a mask row at the top of each segment. 

The scope of the invention is defined in the accompanying 
claims. 

By means of this arrangement, the invention makes effi- 
cient use of the available silicon area. Also, the memory 
device is flexible since all the memory cells may be handled 
directly in RAM mode for reading and writing. Thus, the 
data structure is easily changed including the mask data of 
the CAM functionality. The memory device has a very 
useful application as a device for handHng address lookup, 
e.g. in a switch or router. 

BRIEF DESCRIPTION OF THE DRAWINGS 

40 

The invention will be described in detail below with 
reference to the accompanying drawings, of which: 

FIG. 1 is a diagram of the prior art CAM/RAM structure, 

FIG. 2 is a diagram of the CAM/RAM segmented struc- 
^5 ture according to the invention, 

FIG. 3 is a diagram of the CAM/RAM block structure 
according to the invention, 

FIG. 4 is a diagram of a detail of a CAM memory 
including CAM rows and a mask row according to a first 
embodiment, and 

FIG. 5 is a diagram of a detail of a CAM memory 
including CAM rows and a mask row according to a second 
embodiment. 

DETAILED DESCRIPTION OF PREFERRED 
EMBODIMENTS 

When using these CAM/RAM structures as parts of a 
custom integrated single chip device, it is important that the 

60 structure is flexible both in terms of configurability and 
physical form factor. The intention of the invention is to 
enable a more flexible partitioning of the complete structure 
for this kind of memory. 

The basic CAM/RAM structure according to the prior art 

65 is shown in FIG. 1. Input data is compared with the contents 
of the match field 5 and bits not to be considered in the 
comparison are defined in the mask 6. The data to be 
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retrieved is stored in the data field 7. FIG. 1 illustrates one FIG. 4 shows a detail of a block of a CAM memory and 

example of the prior art in which the mask is stored in a shows the masked part of the rows including two CAM rows 

separate register and applied to the match data before the 10 and a mask row 9. The top CAM row is the bottom row 

match data is applied to the CAM memory. As mentioned in of an overlying segment and the other CAM row is the first 

the introduction, an alternative (not shown) is to implement 5 CAM row of the next segment under the mask row 9. All 

a mask in each row of the CAM memory. jows contain memory cells 11 which may be addressed 

The additions needed for handling different types of through word lines 3 by the address decoder 2. The memory 
address look-ups are to have a dynamically changeable cells of the CAM rows 10 are each connected to a corn- 
configuration of the sizes of the CAM and RAM parts, parator 14. As is known in the art, match data is applied to 
respectively. In the cas^e of IP address look-upjbereare^ a j^, two bit lines for each cell. If the match data 
numberofdiffe^^ is not masked, one bit line of the pair is the inverse of the 
^ClUK MAC IP-mmFTnTTCT^e distnbution comparator 14 compares the match data with 
between these different types is not knov^^^^^ ^^^^^^ the^nemory cell U. If there is a hit, this is 
it depends on the land of device the chip is used in, and on , . i_ • .t. u ^ 
the environment in which this device is used. Tht size of the ^^8°^^^ °° ih^ m^ich Lne O rumimg through the row On 
necessary memory is on the other hand not that dependent on the other hand, if the match data is masked both bit hnes m 
which type of address types it has to handle, A certain size ^^"J' ^^^^^ resulting m a hit mdependent of the 
of switch needs a certain amount of address memory contents of the memory ceU 11 ("don't care"), 
depending on the amount of traffic it is able to handle, but The mask row 9 also contains memory cells 11, in this 
this traflEic can demand any of the address types, maybe all, case for storing a mask. Each memory cell of the mask row 
maybe just one, but the amount is fairly stable. Therefore it 20 is connected to a mask unit 15 which performs the mask 
is important that this partitioning of the memory can be function. If the match data received by the pair of bit lines 
made dynamically and in rather fine granularity. jg to be masked, the mask unit 15 changes the match data to 

The granularity chosen in this example is 16 word blocks a pair of zeros. If the mask is not to be applied in the cell, 

where the first row in every block is a mask row. The the match data is propagated unchanged. As is known to a 

resulting structure is shovra in FIG. 2. The priority encoding is person skilled in the art, there are various ways to implement 

for the structure of FIG. 2 is stricUy based on order and function of tiie mask unit 15. 

therefore adds to the demand of an efficient RAM handling r j • 1. • ^ 1 « / r i_- i_ 

for moving blocks of data when reconfiguring the look-up ° ^f"" ™- ™^ » 

table ^ ^ r ^j^y ^ shown), are connected m senes. Hence, it will be 

As is shown in RG. 2, the memory comprises a number 30 ^^"^^tf^ ' ""f ^ row affects aU underlying rows of 

of segments 8, each segment containing a mask row 6' at tiie ^« V^^J "^"^^ ^^^^ ^ 

top and underlying CAM rows. Each CAM row comprises arranged in a hierarchical order m accordance with the 

a match field 5 and a data fieklT and preferably also a part Pri^^ty function mentioned below This is not a great 

containing optionally match or data 17. Thus, it is possible disadvantage because of the hierarchical nature of IP 

to have variable length match field and the mask 6' is 35 addresses where it is desired to obtain hits with the longest 

adapted accordingly. P^^x match. 

For practical reasons, all segments 8 have the same size, As an alternative to a common address decoder for all 

e.g. 16 rows, having one mask row and 15 CAM rows. rows, it may be practical to use separate address decoders for 

However, the segments 8 could be customized to various CAM rows 10 and mask rows 9. Aseparate address decoder 

sizes. 40 2' mask rows 9 is outlined in FIG. 4. This increases 

As is known in the art, each row of the memory contains ^^^e a little but might in some cases simplify the use of 

cells wbidi hold a match part to be compared with a ma^ CAM memory. 

and a data part in which is the stored useful data. The In FIG. 5, an alternative embodiment of the mask row 

configuration of the sizes of the match and data parts is arrangement with another design of the bit lines is shown, 

dynamically changeable which is useful for handling e.g. 45 The same reference numerals are used for identical elements 

different types of address look-ups. It is especially useful if in FIGS. 4 and 5. In the alternative embodiment two levels 

some part of the row may be configurable to hold either ofbit lines are provided. Afirsl lower level comprises bit line 

match bits or data bits as the match&data field 17. pairs 12* connecting tiie mask row 9 (the top row) of each 

As is described in tiie introduction of the specification, tiie segment witii its underiying CAM rows 10. A second higher 

form factor and size is important leading to the block 50 level comprises a global bit line pair 12" running uninter- 

structure as is shown in HG. 3. The memory is partitioned ruP^d tiirough all segments but only connected to the input 

into blocks 1 of e.g. 512 words. In the figures only four to tiie top row of each segment, that is the mask row 9. These 

blocks 1 are shovra but generally the device inchides more bit Une pairs 12" connect tiie mask rows in parallel. The 

parallel blocks to offer a complete memory space of e.g. 8 function is identical with the arrangement of FIG. 4, except 

k words. An address decoder 2 is connected to all the 55 match data is applied to the overlying bit line pairs 

memory rows of tiie blocks by means of word lines 3 of 12", so tiiat tiie match data is applied in parallel to all the 

which only the top four arc indicated. The address decoder mask rows and propagated through tiie bit line pairs 12' 

2 cooperates with a multiplexer 4 to enable reading and tiirough all segments at the same lime. Thus, tiie simpler 

writing of tiie individual memory cells. The multiplexer 4 structure of tiie serial bit lines 12 of FIG. 4 is traded against 
selects output data from a sense amplifier (not shown) from 60 a faster search operation in tiie parallel stmcture of FIG. 5. 

one selected memory block 1. The block selection is con- Another advantage is that tiie mask rows do not have to be 

trolled by control logic 16 (mux CTRL) which is responsive arranged in a hierarchical order, since tiie mask rows do not 

to the kind of operation (RAM addressed read or CAM affect each otiier. 

match operation read), some address bits, and prioritising In a RAM addressed read mode, the address decoder 2, 2' 
between hits in different blocks 1. The control logic 16 65 and the multiplexer 4 cooperate to enable reading and 

receives commands from an operation line and match hit bus writing by means of the word lines and bit lines. In a CAM 

(not shown). mode, the masks 6* are applied to the bit lines tiius forming 
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match data lines running vertically through the memory 
blocks. There may be several hits in one and the same block 
1. As is known in the art, a priority means 18, shown in FIG. 
4 only, is connected to the matdi lines 13 for handling CAM 
matches and takes care of ensuring that only one match is 5 
generated in one block 1 of memory. In other words, the 
priority function selects one of the plurality of matches. 

The mask arrangement according to the present invention 
thus makes it possible to implement the masking function 
and mask addressing of the memory device resulting in an 
efficient use of the available silicon area in a flexible way. A 
person skilled in the art will appreciate that the embodiment 
of the invention described in detail here may be varied as to 
form and sizes of the various parts. Terms such as vertical 
and horizontal are used only in a figurative sense and in 
relation to each other The scope of the invention is only 
limited by the claims below. 

What is claimed is: 

1. A CAM/RAM memory device with a scalable stmcture, 
comprising: 20 

a memory having a number of rows of memory cells; 

at least one address decoder connected by word lines to 
the cells of the rows; and 

vertical bit lines for providing match data to implement 2S 
CAM functionality of the memory device, wherein said 
rows include at least one mask row for providing a 
mask affecting the match data on the bit lines. 

2. A memory device in accordance with claim 1, wherein 
the memory is divided into segments, each segment having 30 
a predetermined number of said rows and a mask row. 

3. A memory device in accordance with claim 2, wherein 
the segments have the same size. 

4. A memory device in accordance with claims 1, 2 or 3, 
wherein the address decoder is coimected to all of the rows 35 
in said memory in order to control reading and writing of the 
memory ceUs. 

5. A memory device in accordance with claims 1, 2, or 3, 
further including a separate address decoder connected to 
each said mask row for controlling reading and writing of ^ 
the mask. 

6. A memory device in accordance with claims 2 or 3, 
wherein said mask row is adapted to permit propagation of 
the match data on the bit lines or to permit said match data 

to be changed to "don't care" in accordance with the mask. 45 

7. A memory device in accordance with claim 2, wherein 
said mask row in each of said segments is connected in 
series by said bit lines. 



135 

6 

8. A memory device in accordance with claim 2 or 3, 
further including additional bit lines coupled only to said 
mask rows, and wherein in each segment the mask row and 
said predetermined nmnbcr of rows are coupled to said 
vertical bit lines. 

9. A memory device in accordance with claim 1, further 
including a mxiltiplexer, wherein the memory is divided into 
parallel blocks, each said block having a plurality of rows of 
memory cells and wherein each row in a given block shares 
a word line with its corresponding row in the other of said 
blodcs and said multiplexer is adapted to select said parallel 
blocks for reading data therefirom. 

10. A memory device in accordance with claim 9, wherein 
a plurality of CAM matches may occur in a given block, said 
memory device further including priority means for select- 
ing only one of said matches. 

11. A memory device in accordance with claim 2, wherein 
said memory cells include match parts and data parts that are 
configureablc so as to be dynamically changeable. 

12. A memory device, comprising: 

a plurality of segments each having a CAM/RAM struc- 
ture; 

said CAM/RAM structure including memory cells 
arranged in a predetermined plurality of rows, said 
memory cells in each row coimected by word lines and 
arranged to form a plurality of columns; 

at least one address decoder coupled to said word lines; 
and 

a plurality of data lines coupling said memory cells in said 
colimms and for applying match data to said CAM/ 
RAM structures, wherein one of said rows in each of 
said segments is adapted to store a mask for masking 
said match data. 

13. The memory device of claim 12, wherein each of said 
CAM/RAM structures includes a match field dynamically 
adaptable to a length of said mask. 

14. The memory device of claim 13, wherein said mask is 
operable for masking said match data for said rows in each 
of said segments. 

15. The memory device of claim 14, further including 
means for applying in parallel said match data to all of said 
mask rows. 

« 41 * « * 
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ABSTRACT 



An apparatus and method for letuiiing /and filtering 
des tination and source addresses in alocarareanet work 
(LAN) sys tem is provided to facilitate the transfe r of 
in fonnatiwi packets from o ne local area networlTto 
uiot^r. J[%e apparatus provules a linlTSetween two (or 
more) remotely l ocated LANS through the use of a 
LAN brid ge. Tlie app aratus has a LAN controller int er- 
fac e a micrpprocessbr, a state machine and one or more 
content addressable memories which fiinctj on-tO-moni- 
t or aninformation packet received from the LAN . One 
su ch apparatus is , connected to each of the LANs to be . 
int erfaced and each of tf»gseapparatus to_each other in 
o rder to comminucate mj SM^ betw e e n tf^ LiWs 
Upon det ection orSTiiSforroattQn packet, the appara tus 
examines both the source,and.dcstination^addresscs o f 
tl^s.Ba£ket. The so urce address is compared to a dyn am- 
i cally generated table of source add res ses and tfjeiTc om- 
p ared tQ,thS ncxnnihiL.d estina tiq^^ If the desti- 

na tion address fa fou ndjn , the tab lf^jof sq urc e add resses, 
t han the pac ket is destmed for a n ode on the local LAN 
and wjiT^I^ jen^ the 
tw^I^SNis. IfrMTtSroQie^^ ad- 
dress is,not ia^the table^the packet is fopftfa ided to the 
re mot j Cj^A N. 

15 Claims, 6 Drawing Sheets 
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802.5. They differ from one another both as to the phys- 
APPARATUS AND METHOD FOR LEARNING ical structure and as to the structure of their information 
AND FILTERING DESTINATION AND SOURCE packets. A thorough description of the respective sys- 
ADDRESSES IN A L OCAL AREA NETWORK tcms is contained in the publication Understanding Local 
SYSTEM 5 Area Networks referred to above. In panicular, the 

structure of the packets presents different problems 
A portion of the disclosure of this patent document when attempting to interface two LANs utilizing either 
contains material which is subject to copyright protec- of these types. However, in both types of LANs and in 
tion. The copyright owner has no objection to facsimile LANs other than those of the Ethernet or Token Ring 
reproduction by anyone of the patent diocument or the 10 type, the structure of the information packets has both a 
patent disclosure, as it appears in the Patent and Trade- source identification (Source ID) and a destination iden- 
mark Office file or records, but otherwise reserves all tification (Destination IDX so that the sender node and 
copyrights whatsoever. reccivernode can communicate and so that the receiver 

BAOCGROUND OF THE mVENTION '^'^r:£::^'^^^t-^r^ZZ 
This invention relates generally to Local Area Net- tion, although not restricted in any way to the interfac- 
work (LAN) systems, and more particularly to a system ing of two LANs (both of which are either of the Ether- 
in which two or more separate LANs may be connected net or Token Ri^g type), allows the efHcient flow of 
or **bridged** so that traffic between LANs is restricted messages from one LAN to another, as will be hereinaf- 
to those destinations which are in the remote LAN. 20 ter described. 

This results in more efHcient use of the processor con- A fundamental problem encountered by the interfac- 
trol in the bridge device and faster throughput of mes- ing of two LANs, cither locally or remotely, is that 
sages from one LAN to another. when a station or node on one of the LANs (with a 

The computer industry has recently seen the wide- certain Source ID or address) wants to send a message 
spread growth of LAN systems. A LAN is, broadly, a 25 to another node (with a certain Destination ID or ad- 
computer communications system which is shared by dress), the source node may not know whether the 
usen in a limited (hence the terminology local) locale to destination node is located on its own LAN or on the 
share information and resources. A wide area network other LAN. Thus, in certain circumstances, the bridg- 
(WAN) is, by comparison, a computer communications ing circuitry may automatically send all information 
system in which the users are dispersed over an unlim- 30 packets to the remote LAN, even if the destination node 
ited geographical area (hence the terminology wide). is not on the other LAN. This takes up valuable proces- 
LANs and WANs come in many varieties and types. A sor time in the processor contained in the bridging cir^ 
good description of LAN types and thdr respective cuitry. As a result, the throughput from one LAN to the 
functioning is given in Understanding Local Area Net- other over the link between the two LANs may de- 
works, by S. Schatt, published by Howard W; Sams & 35 crease, leading to inefficiency in the system. 
Co., 1989. WANs may take the form of a large single Other than simply ignoring the fact that it may not be 
interconnected system or be composed of a plurality of known whether the destinatiixi node is on one LAN or 
LANs connected together so as to form a WAN. It is to another, it is possible to provide a method by way of a 
the latter of these systems to which the invention of the software solution under which not all packets are sent 
present invention pertains. . 40 to the remote LAN. Under this solution, all destination 

The present invention relates to WANs composed of ID comparisons are accomplished in software, but only 
a number of interconnected LANs. These LANs may after the complete padcet is received. An address table 
already have been in existence and the user may have a is built into the software of the bridge system to learn 
need to connect them in some fashion because, for ex- the source addresses. Unfortunately, due to the nature 
ample, the company has opened a new office with its 45 of the low level Token Ring protocol, the Address 
own LAN system to which the home office must com- Recognized/Frame Copied indicator bits, which appear 
municate. The company has the choice of developing a at the end of the packet after the Ending Delimiter, 
new WAN system so that both offices may be con-. must be set in real time. A description of the foregoing 
nected or may utilize the existing separate LANs and structure is given in Token Ring Access Method and 
provide some means to allow the two systenos to coni- 50 Physical Layer Specifications, published by the IEEE, 
miunicate. Also, the company may have a preference for 1985, pages 27 39, as well as in the Understanding Local 
a particular type or brand of LAN and a wish to remain Area Networks publication referred to above, especially 
with that system. Additionally, the needs of a particular at pages 46-47 and 79^106. The node must know that 
user will depend on the functionality demanded. For the packet was addressed to the node and that there is a 
example, a user may merely have a need to transfer 55 buffer available. This is easily accompli^ed by the 
batch information from on LAN to another on an occa- Token Ring Controller IC in a single address environ- 
sional basis. On the other hand, the user may have a ment (such as a workstation or server), but in a txanspar- 
need to have a continual transfer of information be- ent bridge environment, where every packet must be 
twe«i the LANs and the need to have this transfer take looked at» this can only be accomplished in hardware. It 
place seamlessly. It is to the latter functionality that the 60 can be ignored by the transparent bridge designer be- 
device of the present invention pertains. Each of the cause the bw levd protocol is ignored by some work- 
LANs tisually contains a plurality of stations or nodes, station/server software developers, but there are those 
which can both receive and transmit information to packages that do not ignore the protocol and will indi- 
other stations and nodes. cate fault on the network if the bits are not set properly. 

Two of the more popular LAN systems commer- 65 It is therefore a principal object of the present inven- 
cially available today are the Ethernet and Token Ring tion to provide an apparatus and method for learning 
LAN systems. These LANs are described and specified, and filtering destination and source addresses in a local 
respectively, by IEEE Recommendations 802.3 and area network system. 
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It is another object of the present invention to im- lion capable of comparing large numbers of addresses in 

prove throughput for communications from one LAN real time. As an alternative to the CAM devices. A 

to another LAN over the device which embodies the memory array (48-bits wide) could be used, but the 

present invention by using the processor only for infor- sequential search time would limit the number of ad- 

mation packets which will be forwarded to the remote 5 dresses that could be compared before the end of the 

LAN. packet is reached. 

It is yet another object of this invention to facUitate The present invention is intended to be used in a 

communication between two LANs of the same type bridge requiring the characteristics of transparent 

remotely located regardless of whether the LANs are of bridging, wherein the existence and interconnection of 

the Ethernet or the Token Rmg variety. 10 a pluraUty of LANs is not apparent or is otherwise 

SUMMARY OF THE INVENTION transparent to the user. This could be a bridge devoted 

^ ^ . . ^ to transparent bridging, or a bridge that combines trans- 

In accordance with the present mvention, when parent bridging with source routing. The present inven- 

bndpng Aita between two or more Local Area Net- tj^n can also be used in a mode to provide specific 

^°5^.<^^> '^!^ ""^^ ^ destination address filtering If the source address Iton- 

«nbodimcnt of the pi^t myentton wUl le^ the sta- ^ ^ ^^^^ ^^^^ ^^^^ ^ ^^^^ ^ ^ 

tion addresses of the nodes attached to the bndge, and microprocessor 

X^tLi^bl'i^^^ The preJE^tion is described in the specification 

i^!I^^^^.n i™^^ w>,ioh ,n ^ anIEEE 802.5 Mbps Token Ring envir^l^t and 

nJ^^TTfn^^^ ^ an IEEE 802.3 Ethernet lObaseS, l(lase2 and lObaseT 

chip interface, md one or more content addressable ^K,^^^'^. ^oken Rmg enviromnent or a 100 Mbps 

memories. The state machine monitors the LAN con- ™^ enym)nment. The mvention. However is not 

troDer chip to determine when a new packet is bemg 25 ^T^^ T V^^'* ^f" ^ t^^'^ 

received from the LAN. It then monitors the LAN Ip^^. ^ P^*^^^ ^^^^ spec- 

controller chip interface to evaluate the destination and "ications. ^« . 

source addresses. The source address is compared to a Appendix A is a hstmg of Boolean equations and state 

table of previously learned source addresses and added »achme descnption language utilized for carrying out 

to the table if it b not already in the table. The table of 30 apparatus and method of the present invention, 

source addresses is compared to the incoming destina- BRIEF DESCRIPTION OF THE DRAWINGS 
tion address. If the destination address is found in the 

table of source addresses, then the packet is being sent ^^^^ diagram of a representative portion 

to a node on the local LAN and should not be sent of two LANs with a remote bridge providing connec- 

through the bridge. If the destination address is not 35 between them. 

found in the table, then the packet should be forwarded 2j is a diagram of a known Ethernet/IEEE 

over the link to the other LAN. ^^-^ Pac^^et structure. 

There are various han^ishflW^ bits between the state ^ »s a diagram of a known Token Ring/IEEE 

machine and the microprocessor mterface. One of the ^ packet structure. 

bits insures that the microprocessor reads the learned 40 ^^G- ^ is a block diagram of a LAN bridge station, 

source address before another source address can be FIG. 4 is a block diagram of the LAN source address 

learned. Another insures that the microprocessor does learning/destination address filtering apparatus, 

not access the table of source addresses unless the state F^G* diagram of the LAN source address 

machine is in a "holding" loop. The table of source learning/destination address filtering apparatus state 

addresses has a finite size. There is an indication to the 45 niachine. 

microprocessor that indicates when the table is full. I^G- ^ is an exploded view of the state machine used 
This invention improves known technology by al- to implement the embodiment of FIG. 5 showing all 
lowing the low-Ievd Token Ring protocol Address sampled inputs and controlled outputs. 
Recognized Indicator (ARI) and Frame Copied Indica- FIG. 7 is a schematic diagram of the start of destina- 
tor (FCI) bits, found in the Frame Status (FS) byte that 50 tion address detector shown in FIG. 4. 
follows a packet, to be set in real time (as the packet is »>ct a ti t^cc^ vrtrr^^^r 
being reUyed through the LAN receiver to the LAN ^"^^"^nSi?^ 
transmitter). If these bits are not set correctly, then low rKfc^EKKED EMBODIMENT 
level software may indicate LAN errors to the node's Referring now to FIG. 1, in LAN 10, a medium 12 
host processor, which could result in the node remov- 55 interconnects a number of nodes 14 to facilitate digital 
ing itself from the ring. communication among them. Nodes may be server 
The present invention also gives the bridge, whether devices for one or more data terminals (DTEsX corn- 
it be Ethernet or Token Ring, the ability to filter all puters, etc. Different nodes can represent equipment 
LAN packets regardless of the load of data on the net- supplied by different vendors, but all nodes commimi- 
work. The filtering rate of the bridge is no longer a 60 cate via the LAN medium in accordance a packet type 
function ofthe performance ofthe microprocessor. The protocol 

microprocessor will only be given packets that need to Occasionally, it is necessary for nodes to communi- 

be forwarded to the remote LAN. cate with other nodes not directly connected to the 

The present invention is facilitated by the availability LAN medium. Of the several methods to accomplish 

of Content Addressable Memory (CAM) devices, 65 this connection, a transparent bridge 16 is shown in 

where the incoming destination address and source FIG. 1. The invention can be applied to this type of 

address can be compared to a table of previously stored bridge, whether remote (connected to another bridge 

addresses simultaneously. This makes the present inven- via a Wide Area Network link) of a local (co-located in 
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the same chassis) and can also be used in conjunction clock to the LAN Controller 46 LAN Controller 46 
with the Token Ring Source Routing protocol. may be a commercially available integrated circuit. 

Referring now to FIG. 2a, representing an Ethernet such as the AMD 7900, available from Advanced Micro 
date packet, each data packet 20 begins with a preamble Devices. In the absence of the present invention, all 
22 of 64 bits consisting of alternating Is and Os and 5 packets received by the LAN controller 46 will be . 
ending with two consecutive Is. Next a sequence of placed in the shared memory 48, requiring the micro- 
forty eight destination ID bits 24 identify the node for processor 50 to allot valuable time to looking at each 
which the packet is intended, and a sequence of 48 packet and determining whether it should pass it along 
source ID bits 26 identify the node which originated the to the remote bridge or discard the packet. In addition, 
packet* The n bytes of data 28 appear followed by four 10 and again in the absence of the present invention, in 
CRC bytes 30 at the end of the packet. The structure of some Token Ring applications the low level packet 
the Ethernet packet described herein is not a part of the protocol will be violated because the Address Recog- 
present invention, but is. a data packet which is de- nized/Frame Copied bits in the Frame Status Field 39 
scribed and specified by IEEE Recommendation 802.3. could not be set while the packet was passing through 

Referring now to FIG. 2^ representing a Token Ring 15 the LAN Contrc^ler. 
data packet, each data packet 32 begins with a Starting Referring now to FIG. 4, Uie invention facilitotes two 
Delimiter 34 of 1 byte comprised of Manchester code methods to interface with the data stream being re- 
violations. Manchester encoding is a known method to ceived from the LAN: (1) a serial path using the Re- 
encode serial data and is specified in the IEEE publica- ceive Clock 100 and Receive Data 102 signals that con- 
tion referred to above on page 73. Next, a sequence of 20 nect the Manchester encoder/decoder and the Ethernet 
two bytes comprises the Access Control/Frame Con- LAN controller chip (for Ethemet/8023 applications); 
trol fields 36. Then a sequence of forty eight destination and (2) a parallel path that allows the invention to take 
ID bits 26 identify the node which originated the data directly from the Token Ring Controller chip 
packet. The n bytes of date 28 appear followed by four parallel bus 104 after the Manchester encoder/decoder 
CRC bytes 30 at the end of the packet. The packet is 25 and the internal LAN Controller serial to parallel con- 
closed by an Ending Delimiter 38 of 1 byte comprised version (for Token Ring/802.3 applications) 
of a different Manchester code violation than the Stert- In the case of an Ethernet LAN, a separate circuit 80 
ing Delimiter 34. Following the Ending Demlimiter 38 looks for the Ethernet preamble 22. The Ethernet pre- 
and outside the CRC 30 is the Frame Status Field 39 amble consists of 64 bits of alternating ones and zeroes, 
containing the Address Recognized/Frame Copied bits. 30 the end of which is designated by two consecutive ones 
The node on the LAN can determine by the setting of OHustrated by the binary octet '1010101 V) The Start of 
France Stetus Field 39 whether the destination node is Destination Address Detector 80, shown in FIG. 4 and 
located on the LAN and whether the message was in detail in FIG. 7, uses a serial to parallel shift register 
received. The functioning and structure of the Frame 300 with the receive date 102 signal feeding the serial 
Stetus iMeld 39 is detailed in the IEEE publication re- 35 input and the receive clock 100 continuously clocking 
ferred to above, at pages 34-35. the date through shift register 300. Any time the parallel 

The structure of the Token Ring packet described output of the shift register 300 matches the fixed binary 
herein is not a pan of the present invention, but is rather pattern 10101011 (the end of-preamble octet described 
a structure which is specified and described by IEEE and specified in the IEEE 802.3 specification) of the 8 
Recommendation 802.5, contained in the IEEE publica- 40 bit equivalence comparator 302, an end of preamble 
tion referred to above. These Address Recognized/- signal 304 is asserted. This signal 304 is "^ANDed" with 
Frame Copied bits convey to the sending node whether the receive enable »gnal 104 at 306 such that the Start . 
or not the destination ID 24 was recognized by a node Stete Machine signal 110 and the Synchronize Counter 
on the LAN and whether or opt the receiving node signal 114 will assert only when a packet is being re^ 
copied the frame into its memory. The decision as to 45 ceived. 

how these bits are to be set must be rendered before the The circuit 80 generates a signal 114 that synchro- 
end of the packet is relayed through the LAN control- nizes the serial to parallel shift register 84 to the incom- 
ler IC. ing date stream. A counter 85 is started at this time to 

Referring now to FIG. 3, the transparent bridge de- determine when 16-bits of the destination ID have been 
noted as 16 contains a medium access unit 40 that re- 50 serially shifted. At the end of 16 bits the counter signals 
ceives all digital date signals that appear on the LAN the state machine 94 that the parallel registers can be 
medium and transmits onto the niedium digitel date "dumped" to the 48-bit address latch 82 within the 
signals that are generated from the bridge. Digital date content addressable noemory 86. This 48-bit latch within 
signals are carried on the medium 12 in Manchester the content addressable memory chip has to be loaded 
. encoded form. Medium access unit 40 is connected to a 55 in three consecutive 16-bit operations. In the remainder 
Manchester encoder/decoder 42 that decodes the en- of the description of the present invention, the opera- 
coded digital date received from the medium access unit tion of the invention is the same for Ethernet and Token 
40 into a stream of non Manchester encoded date and Ring The serial to parallel shifting and synchronization 
Manchester encodes streams of non Manchester en- aredoneindependently of the stete machine, so that the 
coded date generated by the node for delivery to the 60 state machine does not need to know which type of 
medium access unit 40. A commercially available inte- LAN (Ethernet or Token Ring) it is operating on. 
grated circuit available for purposes of providing the In the case of a Token Ring LAN, the Start of Desti- 
functioning of the Manchester encoder described herem nation Address Detector 80, the Serial to Parallel Con- 
is an AMD 7992 device, available from Advanced vertor 84, and the 16 bit Counter 85 are contained 
Micro Devices, 65 within the Token Ring Chip set commercially available 

The Manchester Encoder/Decoder determines that from Texas Instruments (as TMS380). The TMS380 
there is signaling.on the LAN medium. When it begins presents a parallel bus that contains the information to 
decoding the signal, it conveys the decoded date and operate the stete machine. 
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The state machine 92 controls the address storage 
latches 82 with the latch address signal 112. The state 
machine referred to generally in FIG. 4 as 92 actually is 
a representation of a state machine 94 and a control 
decode 96, described in detail below with reference to 3 
FIG. 6. When a complete address is latched into the 
address storage latches 82 of the content addressable 
memories 86, the content addressable memories auto- 
matically begin a table comparison process. One con- 
tent addressable memory wilt simultaneously compare 10 
256 48-bit data string (in this case LAN addresses) and 
report the result as a match (if the comparing string is 
found in at least one of the 256 locations) or a no match 
(if the comparing string is not found in any of the 256 
locations). One embodiment of the present invention IS 
allows for up to eight content addressable memories to 
be used in parallel operation for a total of up to 2048 
(256 times 8) 48 bit LAN addresses. Addresses can be 
stored in the content addressable memories by one of 
two methods: (1) the microprocessor 50 (which may be 20 
a M68000 scries microprocessor available from Motor- 
ola) can directly enter the LAN addresses (which can 
be either learned from packets received or from a pre- 
determined table entered by a user of the LAN bridge), 
or (2). the state machine can learn the source addresses 2S 
26 from the incoming packets as described below. When 
a comparison to the address table 90 is completed, the 
address compare and store lo^c 88 signals the state 
machine 92 with the address-in-table 118 signal. The 
state machine 92 signals the LAN controller to discard 30 
a packet by asserting the reject packet 108 signal. In the 
case of an Ethernet LAN, this signal is '"ORed** with 
the collision signal from the Medium Access Unit 40, to 
force the Ethernet LAN Controller 46 to discard the 
packet. In the case of Token Ring, this signal is con- 33 
nected to the XFAIL signal of the Token Ring Chip set 
(Texas Instruments TMS380), and its inverse is con- 
nected to the XMATCH signal. The assertion of 
XFAIL will cause the Token Ring Chip set to discard 
the packet unless the packet is determined by the Token 40 
Ring Chip set to be addressed to the bridge directly. 

Referring now to FIGS. 4 and 5, when no packets are 
being received from the Manchester encoder/decoder 
42, the state machine is idle at step 200. The state ma- 
chine remains idling while waiting for the start of the 45 
packet at 230. As soon as the start of a packet is indi- 
cated to the apparatus of FIG. 4 at step 232, the start of 
destination address detector 80 begins looking for the 
destination ID 24 (shown in FIGS. 2a and 2b) in the 
data and the state machine waits at 202 for an indication 50 
of the start of the destination address at 234. When the 
destination address detector 9Q determines that the des- 
tination ID 24 is about to start, the detector 80 signals 
this to the state machine 92 by asserting the start state 
machine signal 110 at 236. This places the state machine 55 
92 in the capture destination address mode at step 204. 

The state machine 92 then c^tures the destination ID 
24 in the address storage latches 82. Alter the destina- 
tion ID 24 is captured completely at 238 and completely 
stored at 240, the content addressable memories 86 60 
compare at step 206 the destination ID 24 with the 
address table 90 of previously stored source IDs 26 
(shown in FIGS. 2a and 2^). At the present time content 
addressable memories are available commercially, such 
as the AMD 99C10 integrated circuit available from 63 
Advanced Micro Devices, in size of 256 address capac- 
ity. These CAMs may be ganged together to allow 512, 
1024, etc. of addressable memory. The destination ID match- 
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comparison is completed before the source ID is cap- 
tured. The result of the comparison is conveyed to the 
state machine by the address in table signal 118. The 
state f this signal is stored in step 208 by the state 
machine 92. After the comparison of the destination ID 
24, the state machine enters the capture source destina- 
tion mode in step 210. When the complete source ID 26 
is captured in 242 and completed at 244, the state ma- 
chine 92 causes the content addressable memories to 
compare in step 212 the stored source ID 26 to the 
address table 90 of previously stored source IDs. When 
the comparison is complete* the sute machine 92 checks 
the state of the address-in-table signal 118. If this signal 
is active such that the source address is found in the 
table, then the address has been stored previously as 
denoted at 248. If this signal is not active such that the 
source address b not in the table at 246, then the state 
machine will enter the new source ID into the address 
table 90 by sequencing through the add source address 
to table mode in step 214. The stored stete of the address 
in table signal 118» in step 216 is then used to signal to 
the LAN controller to either reject the packet at 220 if 
the destination address is found in the table at 252 using 
the reject packet signal 108 or pass the packet to the 
microprocessor in step 218 if the destination address is 
not found in the table denoted at 250. In the case of 
Ethernet, this signal is **ORed*' with the collision signal 
from the Medium Access Unit 40 to force the Ethernet 
LAN Controller 46 to discard the packet. In the case of 
Token Ring, signal 108 is connected to the XFAIL 
signal of the Token Ring Chip set (Texas Instruments 
TMS380), and its inverse is connected to the XMATCH 
signal. The assertion of XFAIL will cause the Token 
Ring Chip set to discard the packet unless the packet is 
detennined by the Token Ring Chip set to be addressed 
to the bridge directly. The state machine then enters the 
wait for end of packet mode denoted at 254 and waits 
for the end of the packet at 256 in step 222 before re- 
turning to the idle state m step 200. 

Referring now to FIG. 6, the implementation of the 
Token Ring versbn of FIG. 5 uses groups of signals 
that are represented as a single signal in FIG. 5. In FIG. 
6, it b seen that the state machine 92 corresponds to the 
state machine 92 of FIG. 4, and b comprised of the state 
machine 94, and state decode 96, which may be, as 
noted above, programmable logic arrays. The meaning 
and significance of the various signals represented in 
FIG. 6 are given by the Table below: 

TABLE OF SIGNAL NAME ABBREVIATIONS FOR HO. 6 
PKTCNT 



PE 



PscKet Count 



Pkrand loul 
Enable 



CMDCAMLT CoMnunD 
CAMUTcfa 



CMDCAM CoMmuD CAM 

PASS- PASSCoNTRoL 
CNTRL 



MATCH LATCH 



Incrementt a 16-bit coanter 
to keep track of packets 
received from the LAN 
Altowj the address on the 
Token Ring Chip set 
address/daU bus to be 
latched into a set of 
incrementing counters. 
When the CAM command 
register is read this signal 
win latch the address of the 
fbst empty CAM location. 
This KgtaH enables a CAM 
command to be written to 
the CAM control register. 
This signal asxru near 
the end of the sute machine 
cycle to cause the LAN 
controller chip to pass or 
reject the current packet. 
This signal latches the state 
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-continued 



LATCH 



STRIP 



RDS 



WRX 



CSX 



68KEN 



ENCMUX 



TMSLTCH 



SALWRA 
SALWRB 



SAL WREN 



WRALL 
CSALL 



NOPASS 
68KACC 
CEP 



STRIP 



Read one CAM 



Write one CAM 



Chip Select one 
CAM 



68K£NabIe 



ENCode 
MUtipleXd 



TM380UTCH 



Source Address 
Learned 
WRite A/B 



Source Address 
Learned ENable 



WRite ALL 
Chip Select ALL 



NOPASS 
68K ACCeu 



Count Enable 
Pardlel 



CLRTSA 



CLeaR Transmit 

Strip 



of the "ANDed** MATCH 
signals from the CAMs after 
the CAMs have been loaded 
with the destination address. 
This signal is used when the 
state machine detects an 
automatic reject, such as a 
Token Ripg MAC level 
frame. 

This signal is used to single 
out a single CAM in a multi- 
ple CAM system to be read 
from. 

This signal b used to shigle 
out a ungle CAM m a molti- 
pie CAM system to be 
written to. 

This signal is used to smgle 
out 4 single CAM in a multi- 
ple CAM systqn to be read 
or written. 

This signal is asserted in 
response to a SLVHOLD 
ngnal assertion when the 
state madnne is in the idle 
state and the microprocessor 
requests access to the 
CAMS. 

This signal latches the num- 
ber of the CAM (in a 
multiple CAM system) 
where a newly learned 
source address has been 
stored. 

This signal is synchroniied 
with the demultiplexing of 
the Token Ring Chip set 
iddress/data bus. 
These signals control the 
addressing of a set of 
register files thai store the 
oewYy learned 48-bit source 
address. 

This signal controls the 
wriUng of the newly learned 
48-bit source address to a set 
of register filesi that can 
subsequently be read by the 
niicrt^tfoccssor. 
These signals are asserted to 
cause a simultaneous write to 
all of the CAMS. TTiey are 
used to write the source aitd 
destination addresses to all 
of the CAMs (in a multiple 
CAM system) at the same 
time. 

This signal has an identical 
function to STRIP. 
This signal has an identical 
function to 68KEN. 
This signal allows the I64}it 
counter that contains the 
address latched from the 
Token Ring Chip set 
address/data bus (by the 
signal PE described above) 
to increment to the next 
sequential address, which 
will indicate where the next 
received 16-bits of the 
incoming packet will be 
stored by the Token Ring 
Chip set. 

This signal clears the 
register that holds the source 
address of a packet that the 
Token Ring Chip set has 
queued for transmit This 
prevents the state machine 
from learning a packet sent 
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TABLE OF SIGNAL NAME ABBREVIATIONS FOR FIG. 6 

around the ring by this node 
(the bridge). 

CMDCDl 2 CoMmand CAM These signals are used to 
CMDCD14 Data change bits on the data bos 

bit 12/14 (through an octal buffer 

CDBUFEN CAM Data bus controlled by the 

BUFfer ENable CMDCAM signal described 
above) when the state 
machine needs to reset the 
CAM segment counter bits 
or needs to write the 
source address into the CAM 
memory or the microproces- 
sor dau bus bufTers have 
access to the CAM daU bus. 
XMATCH external This signal is bi-directional 

MATCH from the Token Ring Chip 

set As an inputs when both it 
and the XFAIL lines are 
active, the chip set is signal- 
ling the beginning of anew 
receive packet As an output, 
it indicates to the chip set 
that it should pass the packet 
to the microprocessor. 
XFAIL cXtcmal FAIL TOs signal is bi-directional 

from the Token Ring Chip 
set As an input, when both it 
and the XMATCH lines are 
active, the chip set is signal- 
ling the beginning of a new 
receive packet. As an output 
it indicates to the chip set 
that it should reject the 
packet currently being 
received, without infonning 
the microprocessor. 
LEARN- LEARN ONLY This signal is set by the 

ONLY microprocessor to allow the 

state machine to learn the 
sotirce addresses of incoming 
packets and reject them 
regardless of the destinatioa 
address. 

ADD- ADDress These signals are the equiva- 

40 MATCHO MATCH 0/1 lence outputs of standard 
ADD- octal comparators that 

MATCH I compare the current address 

on the Token Ring chip set 
bus with the stored and 
incremented address latched 
by the PE sgnal 
(described above). 
HNDSHK HaNDSHake This signal faidicaies to the 
Slate machine that the 
microprocessor has read the 
previously leamed source 
address. If this Ui is not 
active, then the state 
machine will proceed to 
learn a new source address. 
If this bit is active, then the 
state madiitte wfl] not learn 
any other source addresses, 
but it will still fOter based on 
the destination address. 
ALLFULL ALLFULL This signal indicates that the 

CAM(s) are fun and that no 
more new source addresses 
can be stored in the CAM(s). 
60 TSAMATCH Transmit Strip This signal is the equivalence 
Address MATCH output of a standard 

comparator that compares 
the packet currently being 
received with the source 
address of the packet most 
65 recently transmitted. This 

prevents the sute machine 
from learning the source 
- address of the packet it Just 
transmhted. 



20 



25 



30 



35 



45 



50 



55 



06/25/2003, EAST Version: 1.03.0002 



11 

•continued 



5,136.580 



12 



TABLE OF SIGNAL NAME ABBREVIATIONS FOR FIO. 6 



MATCHANY 


MATCH ANY 


This signal fe the ^'ORed*' 
signal of all of the CAMs 
MATCH signals that indicate 
that a match for the 
destination or the source 
address was found one of the 
CAMs. 


SLVHOLD 


SLaVcHOLD 


This signal is a request from 
the oiicroprocessor to place 
the state machine in a 

noioing loop 90 mai mc 
microprocessor can access 
the CAMS without 
intcrruption. 


MAC 


MAC 


This signal is generated 
when the AC/FC portion of 
the Token Ring packet is 
present on the Token Ring 
Chip set data bus. 
It is used by the sUte 
machine to reject MAC 
frames. 
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In operation, the Start of Packet 230 is signified to the 
SUte machine 5^ when both XFAIL and XMATCH are 
asserted. The state decoder 96 de-asserts P£ and asserts 
CEP to capture the TMS380 bus address of the packet 
as it is being stored t(^memory. The state machine then 
enters the Wait for Destination Address loop in step 
20Z When the next sequential address appears on the 
TMS3870,memory bus, the two signal ADDMATCHO ^ 
and ADDMATCHl will assert. The state decoder as- 
serts WRALL to the content addressable memory(ies) 
so they can latch a 16 bit piece of the 48 bit destination 
address. Then CEP is asserted to allow the next sequen- 
tial TMS380 memory address to be located. This cycle 
occurs two more times, allowing the capture of all 48 
bits of the destination address. 

After the third write to the content addressable mem- 
ory(ies) they will automatically begin the Compare ^ 
Destination Address cycle in step 206. Then the state 
machine will advance to the Save Result cycle in step 
208 and the state decode will assert the MAT- 
CHLATCH signal to take a snapshot of the result of the 
comparison and write that result to a storage element . . 
501 for later use. MATCHLATCH is used as the dock 
input to a D type flip flop, the standard storage element 
denoted at 501 as MATCH STORAGE. The MATCH- 
ANY signal, also known as the Address in Table de- 
noted at 118 in FIG. 4, will be asserted by any content ^ 
addressable memory that locates the 48-bit destination 
address in its table. The MATCHANY signal is used as 
the "D'* (or dau) input to the storage element or flip 
flop 501. The **Q*' or output of the element or flip-flop 
501 is sent to the RejeclTPass control element or de- 
coder 503. 

The state machine checks the state of the ALLFULL 
signal to determine if there b space available in any of 
the content addressable memory(ies) in the case the 
source address (which will be captured next) needs to 
be stored in the content addressable memory. It also 
checks the HNDSHK signal to determine if the micro- 
processor 50 has read a previously learned source ad- 
dress. If either of these signals is asserted then the state 
machine skips the source address portion of the state 55 
machine (in steps 210, 212 and 214) and proceeds to 
recall the result of the Destination Address comparison 
216. 



If both ALLFULL and HNDSHK are de asserted, 
the state machine continues to look for data being 
stored in sequential memory locations on the TMS380 
memory bus. It captures the three 16 bit pieces of the 
Source Address in the content addressable memory(ies) 
address latch by asserting WRALL and CEP in the 
same manner as described previously. The state decode 
also asserts SALWRA, SALWRB, and SALWREN to 
capture the source address to a 48-bit register that can 
be read by the microprocessor if the address is added to 
the content addressable memory address table 90. When 
the 48-bit address is complete, the content addressable 
memory(ies) automatically compare the address to the 
addresses previously stored in the address table 90 (as 
internal structure of the content addressable memory). 
The state machine advances to the Compare Source 
Address cycle in step 212. 

If the content addressable memoryOes) finds a match, 
the MATCHANY signal wiU be asserted to the state 
machine 94 and to the storage latch 501 denoted as 
MATCH STORAGE in FIG. 6 for later use. This sig- 
nal is coupled with tiie TSAMATCH signal which, if 
asserted, will indicate that this packet currently being 
received was transmitted by this bridge On a ring, the 
transmitter will receive all packets that it transmits and 
is responsible to strip off its transmitted packet so that it 
does not continue around the ring indefinitely). The 
TSAMATCH signal is created by comparing the re- 
ceived source address to the last queued transmit ad- 
dress read from the TMS380 memory bus. If the TSA- 
MATCH signal is asserted, then the state machine does 
not learn the received source address (if it did learn the 
address of a packet that came from another LAN via 
this bridge, then packets originated by local nodes des- 
tined for this address wiU be filtered in the future, 
thereby defeating the bridging function, since the 
learned address table is used for fDtering destination 
addresses). 

If the source address is to be added to the table, then 
the state machine enters the Add Source Address To 
Table cycle in step 214 and the state decode asserts 
RDX, WRX, and CSX to the content addressable mem- 
ory with an empty location (determined by the indtvid- 
tud FULL signals supplied by the content addressable 
mem ory fed through a priority encoder such as the 
TTL chip 74147). First the content addressable memory 
is read with the RDX and CSX signals. The first avail- 
able location is from the status register of the content 
addressable memory into an 8-bit register using the 
CMDCAMLT signal (this address indicates one of 256 
menK)ry locations). Then a command is issued to the 
content addressable memory by asserting the WRX. 
CSX, CMDCAM, and CMDCD12 signals. The state 
machine then advances to the Recall Destination Ad- 
dress Compare Resuh cycle in step 216. 

This group of sign^ consists of PASSCNTRL, 
STRIP, and NOPASS. Afl three signals control a de- 
coder 503 denoted as REJECT/PASS CONTROL in 
HG. 6 to enable either XMATCH or XFAIL (which 
have been held in a high impedance state). If 
PASSCNTRL is asserted, then the decoder 503 looks at 
the state of the signal stored by the signal MAT- 
CHLATCH back in the Save Result cycle 208. If this 
signal is asserted, then a match was found in the content 
addressable memory indicating that the address was 
learned from a local node. In this case, since filtering is 
based on the destination address, the packet currently 
being received is destined for a local node and should 



06/25/2003, EAST Version: 1.03.0002 



13 



5,136,580 



not be passed to the microprocessor 50 and XFAIL will 
be asserted. If the latched signal is de asserted, then this 
packet is bound for a non local destination and should 
be passed to the microprocessor and XMATCH will be 
asserted. If STRIP is asserted, then the state machine is 
in the idle/hold state and the microprocessor is access- 
ing the content addressable mcmory(ics) (68KEN and 
68KACC will also be asserted by the state decode). In 
this case all packets are to be discarded and XFAIL wills 
be asserted. If NOPASS is asserted, then the state ma- 
chine has been placed in a mode where is only learning 
addresses on the local ring. This is controlled by a signal 
to the state machine from the microprocessor called 
LEARN ONLY. If this agnal is asserted, then 



14 



10 



Receive ENAble). As outputs from the external de- 
coder 503 XMATCH and XFAIL are also combined to 
a single signal, called CLSN EN (for CoLiSioN EN- 
able). The ADDM ATCHO and ADDMATCHl signals 
are combined into COUNT 16 signal, which indicates 
that the next 16-bits of a destination or source address 
are available. There is no TSAMATCH signal ance in 
Ethernet the transmitter does not need to strip its trans- 
mitted packet. 

With respect to the state decode 96, only the signals 
P£ and CEP arc not used. All other signals operate in 
the same manner as for the Token Ring. 

The system of the present invention has been de- 
scribed above as being applicable to the interfacing of 



NOPASS will be asserted, which will cause XFAIL to 15 LANs of the Ethernet type and of the Token Ring type. 



be asserted. Then the state machine will advance to 
Wait For End of Packet 254, and assert the packet 
counter signal, PCKCNT, that allows the microproces- 
sor to keep track of the number of packets, whether 
they are filtered or not. 

This state machine is the same for the Ethernet case, 
except that the signals have somewhat different names, 
corresponding to the way they are produced. 
XMATCH and XFAIL as inputs to the state machine 



The system could also be used with the interfacing of 
other types of LANs, such as the types described in 
detail in the publication Understanding Local Area Net- 
works referred to above. 
20 Therefore, while the foregoing invention has been 
described with reference to its preferred embodiment, 
various alterations and modifications will occur to those 
skilled in the art. All such modifications and alterations 
are intended to fall within the scope of the appended 



are combined into a single signal called RENA (for 25 claims. 

Partno 27-260-3; * 
AaacBoly LAFS; 
Date 06/23/89; 
Bevision 2.1: 

Designer g. b. videlock/d. nadrowski; 
Company microcom, inc; 
Location 01; 
Device p22vlO:,_ 

/* Allowable Target Device Types: 22vl0-15 
/«« Inputs **/ 



Fin 
Pin 
Pin 
Pin 
*PIN 

.Pin 

'Pin 
Pin 8 
Pin 9 
Pin 10 
Pin 11 
Pin 13 



Iclk 
! xmatch 
Ixfail 
loarnonly 
SaddnatchO 
! addmatchl 
ihndshk 
Sallfull 
itsanatch 
! isatchany 
slvhold 
!ioac 



A active high to match spec */ 



/*5r Outputs 


»*/ 






Pin 23 


= !ql 


; /* 


ST A */ 


Pin 22 


= !q6 


; /* 


STB */ 


•Pin 21 


= !q9 


; /* 


STC */ 


Pin 20 


= !q2 


: /* 


STD */ 


Pin 19 


= !qO 


; /* 


STE */ 


Pin 18 


= !q5 


; /* 


STF */ 


Pin 17 


= !q4 


: /* 


sTg */ 


Pin 16 


= !q3 


; /* 


STH */ 


Pin 15 


= !q8 


; /* 


ST J */ 


Pin 14 


= !q7 


; /* 


STK */ 



/»» Declarations and Intermediate Variable Definitions **/ 



Sdefine and 
Sdefine or 
. Sdefine OS 
Sdefine off 



& 

•b'l 
•b'O 
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FIELD St = [q9. .0]; 



Sdeflne sO 
Sdefine si 
Sdeflne sla 
Sdefine sib 
Sdefine sZ 
Sdefine s3 
Sdefine s4 
Sdefine s5 
Sdefine s6 
Sdefine s7 
Sdefine s8 
Sdefine s9 
Sdefine slD 
Sdefine sll 
Sdefine sl2 
Sdefine sl3 
Sdefine sH 
Sdefine sl5 
Sdefine 3I6 
Sdefine sl7 
Sdefitt^sie 
Sdefine sl9 
Sdefine s20 
Sdefine 521 
Sdefine* s22 
Sdefine s23 
Sdefine s24 
Sdefine s25 
Sdefine s25a 
Sdefine £25b 
Sdefine s25c 
Sdefine s26 
Sdefine s27 
Sdefine s28 
Sdefine £29 
Sdefine s30 
Sdefine s31 
Sdefine s32 
Sdefine a33 
Sdefine £34 
Sdefine 535 
Sdefine 536 
Sdefine s37 
Sdefine s38 
Sdefine s39 
Sdefine 540 



'b' 
'b' 
'b' 
'b' 
•b- 
b' 



'b'OOOOOOOOOO 
'b'OOOOOOOOOl 
'b'OOOOOlOOOl 
'b'OOOOOlOlOl 
'b'lOOOOOOOOO 
'b'lOOOOOOOlO 
'b'lOOOOOOOll 
'b'lOOlOOOOOO 
'b'lOOOOOOlOO 

'looboooioi 

'1010000000 
'1000001000 
'1000001100 
■1000001010 
•1011000000 
•b'lOOOOlOOOO 
■b'10-"M0001 
'b'll.,.K:.00000 
'b'lOOOOlOlOO 
'b'lOOOOlllOO 
'b'llOlOOOOOO 
'b'lOOOlOOOOO 
•b'lOOOOOOOOl 
•b'OOOOOOOOll 
-b-OOOOlOOOlO 
•b-OOOOlOOllO 
'b'OOOOlOOlll 
'b'OOOOlOOlOl 
•b'OOOOlOOOOO 
'b'OOOOlOOOOl 
•b'OOOOlOOlOO 
'b'lOOOlOOlOO 
•b'lllOOOOOOO 
'b'lOODlOlOOO 
'b'lOOOlOllOO 
'b'llllOOOOOO 
'b'lOOOlOlOlO 
'b'lOOOODOlll 
'b'OOOOlOlOOO 
'b'OOOOOOlOOO 
'b'OOOOOOllOO 
'b"0000110000 
•b'OOOOllODOl 
•b'OOlOOOOOOO 
'b'OOOOllOlOO 
'b'OOOlOOOOOO 
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/*rst seg cnt 'B'OXX'*/^ 
/* addroatch */ 

/* addroatch */ 

/* addraatch */ 

/* addmatch */ 
/* addmatch ♦/ 
/* addraatch */ 
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/* addmatch */ 
/* addraatch */ 



/* new addmatch */ 
/* new addmatch */ 



/*.* Logic Equations **/ 



qO. ar 




learnonly and slvhold: 


ql. ar 




learnonly and slvhold: 


q2. ar 




learnonly and slvhold; 


q3. ar 




learnonly and slvhold; 


q4. ar 




learnonly and slvhold: 


q5. ar 




learnonly and slvhold: 


q6. ar 




learnonly and slvhold; 


q7. ar 




learnonly and slvhold: 


q8. er 




learnonly and slvhold: 


q9. ar 




learnonly and slvhold; 


qO.sp 




off; 


ql^sp 




off; 


q2.sp 




off: 


q3.sp 




off: 


q4.sp 




off; 


qS.sp 




off; 


q6.5P 




off; 


q7.sp 




off; 


qS. sp 




off; 


q9.sp 




off; 



sequence st { 
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present sO. 

• if xniatch & xfail 
if slvhold 
default 

present s 1 

if - mac 
default 

present a la 

default 

present sib 

default 

present s2 

if iaddmatchO or iaddmatchl 
default 

present s3 

present s4 

present s5 

if !addmatchO or Iaddmatchl 
default 

present z6 

present s7 

present s8 

if laddmatchO or iaddmatchl 
default 

present s9 
present slO 



present sll 

if allfull or hndshk 
default 

present sl2 

if laddmatchO or iaddmatchl 
default 

present 5l3 

.if tsaroatch 
• default 

present s 14 



present sl5 

if laddmatchO or Iaddmatchl 
default 

present sl6 
present sl7 



present sl8 

if iaddmatchO or Iaddmatchl 
default 

present s 19 
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* next si; 
next s36; 
neiet sO; 



next s40; 
next sla; 



next sib; 
next s2; 



next s2; 
next 33; 



next s4: 
next s5; 



next s6; 
next s6; 



next s7; 
next s8; 



next sB; 
next s9; 



next siO: 
next sll; 



next s33 ; 
next sl2: 



next sl2; 
next sl3; 



next s26; 
next sl4; 



next sl5; 



next sl5; 
next 6 16: 



next sl7; 
next slS; 



next sl8; 
next sl9; 



next s20; 
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present s20 

present s21 

If natchany 
default 

present s22 
present s23 
present s24 
present s2& 
present s25a 
present s25b 
present s25c 
present s26 



present s27 

if laddraatchO or laddmatchl 
default 

present s28 

if itsaaatch 
default 

present s29 



present s30 

' if iaddnatchO or iaddmatchl 
; default 

present s31 

if Itsaraatch 
default 

present s32 



present 333 

if learnonly 
default 

present s34 

present s35 

present s36 

if Islvhold ar-^ ( ixroatch or Ixiail) 

if xmatcb ar?:--=i:«f ail 

default 

present s37 

default 

present .s38 

if iaddoatchO or Iaddmatchl 
default 

present s39 



20 

next s21: 



next s33; 
next s22; 



next s23; 
next s24; 
next s25; 
next 525a; 
next s25b; 
next s25c; 
next s33; 
next s27; 



next s27; 
next s23; 



next sl7; 
next s29; 



next s30; 



next s30; 
next s31: 



next s20; 
next s32: 



next s35'. 



next s35; 
next 534; 



next sO; 
next sO; 



next sU: 
next s37; 
next 536; 



next s38; 



next s38; 
next 539: 



next s36; 
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present s40 

if faddmatchO or 
default 

} 



5,136,580 



laddmatchl 
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next s40; 
next s35; 



/* Allowable Target Device Types: 20L10-25 »/ 



/** Inputs **/ 



Pin 
Fin 
Pin 
Pin 
Pin 
Pin 
Pin 
Pin 
Pin 9 
Pin 10 
Pin 11 



Iclk 

.!qi 

iq6. 

!q9 

!q2 

!qO 

!q5 

!q4 

!q3 

!q8 

!q7 



/* ST A »/ 
A STB */ 
STC */ 
STD */ 
STE */ 
STF */ 
STG ♦/ 
5TH */ 
ST J */ 
STK */ 



/* 
/* 
/* 
/* 
/* 
/* 
/» 
/* 



/** Outputs **/ 



Pin 14 
Pin 15 
Pin 16 
Pin 17 
Pin 18 
Pin 19 
Pin 20 
Pin 21 
Pin 22 
Pin 23 



! pckcnt 
!pe 

cmdcamlt 
! cttdcam 
ipasscntrl 
imatchlatch 
!strip 
!rdx 
!wrx 
!csx 



/** Declarations and Intermediate Variable I>efinltions **/ 



Sdefine and 
$def ine or 
Sdefine ON 
Sdefine off 



FIELD st = [qS. .03; 



sdefine 
Sdefine 
sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 



sO 

si 

sla 

sib 

s2 

s3 

s4 

s5 

s6 

s7 

s8 

s9 

slO 

sll 

sl2 

3l3 
6l4 
sl5 
sl6 

5l7 

slB 
sl9 
s20 
s21 

s22 

524 
s25 
525a 
s25b 



'b'OOOOOOOOOO 

'b'OOOOOOOOOl 

'b'OOOOOlOOOl 

•b'OOOOOlOlOl 

'b'lOOOOOOOOO: 

'b'lOObOOOOlO 

'bnoooooooii 

'b'lOOlOOOOOO 
'b'lOOODOOlOO 
'b'lOOOOOOlOl 
-b'lOlOOOOOOO 
'b'lOOOOOlOOO 
'b'lOOOOOllOO 
'b-lOOOOOlOlO 
•b'lOllOOOOOO 
•b'lOOOOlOOOO 
-b'10'«>^10001 
'b-l .^ "DOOOOO 
•b'lOuoOlOlOO 
'b'lOOOOlllOO 
. 'b'llOlOOOOOO 
' 'b'lOOOlOOOOO 
•b'lOOOOOOOOl 
•b'OOOOlOOOll 
•b'OOOOlOOOlO 
'b'OOOOlOOllO 
•b'OOOOlOOlll 
'b'OOOOlOOlOl 
'b'OOOb 100000 
•b'OOOOlOOOOl 



/* rst seg cnt */ 
/* addroatch 

/* addraatch */ 

/* addmatcA' */ 

/♦ addmatch */ 
/* addmatch */ 
/* addmatch */ 
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Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 
Sdefine 



s25c 

s26 

s27 

s28 

529 

s30 

s3I 

s32 

s33 

s34 

s35 

s36 

s37 

s38 

s39 

s40 



23 

'b'OOOOlOOlOO 
'b-1000100100 
'b'lllOOOOOOO 
•b'lOOOlOXOOO 
'b'lOOOlOllOO 
'b'llllOOOOOO 

'b'loooioidio 

'b'lOOOOOOlll 
'b'OOOOlOlOOO 
'b'OOOOOOlOOO 
'b'OOOOOOllOO 
'b'OOOOUOOOO 
'b'OOOOllOOOl 
'b'OOlOOOOOOO 
•b'OOOOllOlOO 
'b'OOOlOOOOOO 



Logic Equations **/ 
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/* addroatch */ 



/* addmatch */ 
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/* new addraatch */ 
/* new addroatch */ 



pckcnt = 5t:s34 It st* s35 P st:s39; 
pe = St: sO » £t: s36; 
icrodcamlt = st:s22; 



cn«ican = !q8 & !q8 & !q7 & !q6 & iqS & !q3 & !ql & qO/»st:sl P st:EU C strslb*/ 
I s?fs40: ^ ^ '"^ * «l2/»st:s23«st:s24»st:525;:£25c ./ 



passcntrl = st:s34; 
roatchlat.ch = st:sll; 
strip =• St: s39; 
rdx = st:s22: 



wrx = st:sl # sf.sla « st:s24 <t st: s25 P st:340; 
CSX = st:sl n st:sla « strslb 

# !q9 & !q8 & !q7 & !q6 & q5 & ! q4 ; /«s22tf523«524«s'25l»s25a«s25bas25c:: 

s33#s40*/ 

/* Allowable Target Device Tj^es: 20L8-15 »/ 
/«« Inputs »«/ 



Pin 
Pin 
Pin 
Pin 
Pin 
Pin 
Pin 
Pin 8 
Pin 9 
Pin 
Pin 
Pin 



10 
11 
14 



Iclk 
!ql 
!q6 
!q9 
lq2 
!qO 
!q5 
!q4 
!q3 
!q8 
!q7 

Ibclkl 



/* STA »/ 

/* STB */ 

A STC */ 

/* STD */ 

/* STE */ 

A STF */ 

/* STG */ 

/* STH */ 

/* STJ */ 

/* STK */ 



/** Outputs 

Pin 15 
Pin 16 
Pin 17 
Pin 18 
Pin 19 
Pin 20 
Pin 21 
Pin 22 



!68ken 
! encmux 
itmsltch 
: !salwrb 
i.salwra 
jsalwren 
: !wrall 
: Icsall 



/»* Declarations and Intermediate Variable Definitions / 
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Sdefine 


and 


fit 


Sdef ine 


or 


tt 


Sdefine 


ON 


'b' 1 


Sdefine. OFF 


'b'O 


FIELD St = [q9. 


. 0]; 


Sdefine 


sO 


'b'OOOOOOOOOO 


Sdefine 


si 


'b'OOOOOOOOOl 


Sdefine 


sla 


'b'OOOOOlOOOl 


Sdefine 


sib 


"b'OOOOOlOlOl 


Sdefine 


s2 


'b' lOOOQOQOOO 


Sdefine 


s3 


'b' lOOOQOOOlO 


Sdefine 


s4 


'b' 1000000011 


«uex me 


s9 


'b' 1001000000 


saezine 


SO 


'b' 1000000100 


Sdex xne 


s / 




Sdefine 


s8 


D IU1.DUU000U 


Sdefine 


s9 


b 1000001000 


Sdefine 


slO 


b 1000001100 


Sdefine 


sll 


'b'lOOOOOlOlO 


Sdefine 


sl2 


'b'lOllOOOOOO 


Sdefine 


sl3 


b^uouuiuouo 


Sdefine 


8 14 


'J..* 1)000 1000 1 


Sdefine 


815 


vr:ioooooooo 


Sdefine 


sl6 


'b'lOOOOlOlOO 


Sdefine 


517 


'b'lOOOOlllOO 


Sdefine 


sl6 


'b* 1101000000 


Sdef ine 


819 


'b'lOOOlOOOOO 


Sdefine 


520 


•b" 1000000001 


Sdefine 


521 


'b'OOOOOOOOll 


Sdefine 


822 


'b'OOOOlOOOlO 


Sdoflne=-s23 -*"■ 


■ b'oooDioano 


Sdefine 


s24 


'b'OOOOlOOlll 


Sdefine 


825 


'b'OOOOlOOlOi 


Sdefine 


s25& 


'b'OOOOlOOOOO 


Sdefine 


s25b 


'b'OOOOlOOOOl 


Sdefine 


s25c 


'b'OOOOlOOlOO 


Sdefine 


s26 


'b'lOOOlOOlOO 


Sdefine 


a27 


'b'lliOOOOOOO 


Sdefine 


526 


'b' 1000 101000 


Sdefine 


s29 


'b'lOOOlOllOO 


Sdefine 


s30 


'b'llllOOOOOO 


Sdefine 


831 


'b'lOOOlOlOlO 


Sdefine 


s32 


'b'lOOOOOOlll 


Sdefine 


b33 


'b'OOOOlOlOOO 


Sdefine 


s34 


'b'OOOOOOlOOO 


Sdefine 


835 


'b'OOOOOOllOO 


Sdefine 


s36 


'b'OOOOllOOOO 


Sdefine 


537 


'b'OOOOllOOOl 


Sdefine 


536 


'b'OOlOOOOOOO 


Sdefine 


s39 


•b'OOOOllOlOO 


Sdefine 


s40 


'b'OOOlOOOOOO 
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/* rst seg cnt */ 
/* addroatch */'* 

/* addroatch */ 

/« addmatcK'*/ 

/* addmatch */ 
/* addmatch */ 
/* addmatch »/ 



/* addmatch */ 
/* addmatch */ 



/* new addmatch */ 
/* new addmatch */ 



/** Logic Equations **/ 

68ken = St:836 9 st:537 « st:538 J( 5t:s39; 

encmux = st:823; 



trosltch = 
salvrb = 



Ibclkl: 

5t:5l6 tt 
St: 532; 



st;5l7 n St: 519 « st:620 » st: 528 * 5t:529 «* st:s31 



salwra = str sie K st'-sl? <l5t:s22 » st: 323 « st:s24 « st:s28 » st:E29; 
salwren : !lclk tt 

(st:sl4 
Ht st:sl7 
n st:s20 
; « st:s22 

« st:s2B 
« st:s29); 



wrall 



at:sl 
st:sla 
It st:s4 
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St: s7 

st:slO 

st:5l4 

st:sl7 

St: 320 

St: 526 

st:s29 

5£:532 

st:s37 

st:s40; 
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csall.= o9 « st:sl ft at:sla *» st:slb « at:s21 » st:s40; 

/* Allowable Target Device Types: 20L8-1S */ 



/** Inputs **/ 



Pin 
Pin 
Pm 
Pin 
Pin 
Pin 
Pin 
Pin 8 
Pin 9 
Pin 10 
Pin 11 
Pin 14 



1 
2 
3 
4 
5 
6 
7 



Iclk 
!ql 

iq9 

!<l2 
!qO 

!q5 

!q4 

iq3 

!q8 

!qT 

islvhold 



/* STA */ 
/» STB */ 
/* STC */ 
/* STD */ 
/» STE */ 
/* STF */ 
/* STG */ 
/* STH */ 
/* ST J */ 
/X STK 



/** Outputs **/ 



Pin 
Pin 
Pin 
Pin 
Pin 
Pin 20 
Pin 21 
Pin 22 



15 
16 
17 
18 
19 



InopasE 
!68kacc 
cep^inv 
!cep 
'.clrtsa 
!cmdcdl2 
!cmdcdl4 
!cdbuf en 



/** Declarations and Intermediate Variable Definitions **/ 



Sdefine and 
Sdefine or 
Sdefine OH 
Sdefine OFF 



'b'O 



FIELD £t"= [q9. .03; 



Sdefine sO 
Sdefine si 
Sdefine sla 
Sdefine sib 
Sdefine s2 
Sdefine s3 
Sdefine 54 
Sdefine s5 
Sdefine s6 
Sdefine s7 
Sdefine s8 
Sdefine £9 
Sdefine slO 
Sdefine sll 
Sdefine sl2 
Sdefine sl3< 
Sdefine sl4 
Sdefine sl5 
Sdefine sl6 
Sdefine e17 
$:':efine sl8 
Sdefiue slS 
Sdefine s20 



'b*0000000000 
•b'OOOOOOOOOl 
"b'OOOOOlOOOl 
'b'OOOOOlOlOl 
'b'lOOOOOOOOO 
*b' 1000000010 
•b'lOOOOOOOll 
'b' 1001000000 
'b'lOOOOOOlOO 
•b'lOOOOOOlOl 
'b'lOlOOOOOOO 
'b'lOOOOOlOOO 
'b'100l)001100 
'b'lOOOOOlOlO 
'b'lOllOOOOOO 
'b'lOOOOlOOOO 
'b'lOOOOlOOOl 
'b'llOOOOOOOO 
'b'lOCiJ^lOlOO 
•b'ltf i»11100 
'b'll^J-^OOOOO 
'b'lOOOlOOOOO 
'b'lOOOOOOOOl 



/* rst seg cnt */ 
/» addroatch */ . . 

/* addmatch «/ 

/* addroatch »/ 

/* addroatch */ 

/* addroatch */ 
addroatch */ 
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$d6f ine 


s21 


b 0000100011 




$def ine 


s22 


D 0000100010 




Sdefine 


s23 


D OOOOlOOllO 




Sdef ine 


s24 


b OOQOlOOlll 




Sdefine 


s25 


b 0000100101 




Sdefine 


825a 


b 0000100000 




Sdefine 


925b 


'b'OOOOlOOOOl 




Sdefine 


s25c 


b 0000100100 




Sdefine 


s26 


'b'lOOOlOOlOO 




Sdefine 


s27 


'b'lllOOOOOOO 


/* addmatch */ 


Sdefine 


828 


"b'lOOOlOlOOO 




Sdefine 


529 


'b'lOOOlOllOO 




Sdefine. 


s30 


'b' 1111000000 


/* addmatch 


Sdefine 


s31 


'b'lbOOlOlOlO 


Sdefine 


s32 


'b'lOOOOOOlll 




Sdefine 


s Jo 






Sdefine 


s34 


'b'OOOOOOlOOO 




Sdefine 


s35 


•b'OOOOOOllOO 




Sdefine 


836 


'b'OOOOllOOOO 




Sdefine 


s37 


'b'OdOOllOOOl 




Sdefine 


836 


'b'OOlOOOOOOO 


/* new addmatch */ 


Sdefine 


539 


'b'OOOOllOlOO 


Sdefine 


s40 


•b'OOOlOOOOOO 


/» new addmatch */ 



/** Logic Equations **/ 
nopass = st:s35; 

68kacc = St:s36 ,lt *st: s37 « st:s38 P st: s39; 

cep = -st:sl # st:s4 # st:s7 <f sf.alO . 

» St:sl4 tt st:sl7 <» ,st: s20 st: s2o 
st:s29 St:s32 « St:s37; 



cep_inv = cep; 
clrtsa - st:s32; 

cmdcdl2 = £t:s23 #st:s24 4* st:s25 » St: 525a; 
c»dcdl4= = srtsl n stcsla « £t:slb tt st:540: 
cdbufen = q9; 
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What is claimed is: 

1. An apparatus for transmitting information in pack- 
ets between at least first and second data communica- 
tion systems, each of said systems comprising a plurality 
of nodes» each of said nodes being adapted to receive 
and transmit packets of information, said packets of 
information each comprising destination address infor- 
mation, source address information and data, the appa- 
ratus comprising: 
means for receiving a packet from the first of said 

data communications systems; 
means for capturing destination address information 

from said received packet; 
means for storing said captured destination address 
information; 

means for comparing said captured destination ad- 
dress information to previously stored source ad- 
dresses; 

means f r capturing source address information from 
said received packet; and 



means for storing said captured source address infor- 
mation; 

wherein the comparison of said captured destination 
address information to previously stored source 
address information is completed before capturing 
said source address information from said received 
packet; and said packet received from said first data 
communications system b transmitted to said sec- 
ond data commonications system only if said com- 
pared destination address information is not found 
by said means for comparing said captured destina- 
tion address information to said previously stored 
source addresses. 

2. The apparatus as claimed in claim 1, further com- 
prising: 

means for compariitg said captured source address 
information to previously stored source addresses; 

wherein said captured source address information 
will be added to said previously stored source ad- 
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dresses in said means for storing if the captured 
source address is not contained within said previ- 
ously stored addresses. 
3. A method of transmitting information in packets 
between at least first and second data communication 
systems, each of said systems comprising a plurality of 
nodes, each of said nodes being adapted to receive and 
transmit packets of infonnation, said packets of infor- 
mation each comprising destination address informa- 



tion is not found by said means for comparing said 
captured destination address information to said 
previously stored source addresses. 

6. The apparatus as claimed in claim 5, wherein each 
of said subsystems are local area networks. 

7. The apparatus as claimed in claim 6. wherein each 
of said local area networks complies substantially with 
IEEE standard 802.3 for Ethernet networks, 

8. The apparatus as claimed in claim 6, wherein each 



tion» source address information and data, said method 10 of said local area networks complies substantially with 



comprising the steps of: 

receiving a packet from the first of said data commu- 
nications system; 

capturing destination address information from said 
received packet; 

storing said captured destination address information; 

comparing said c^tured destination address informa- 
tion to previously stored source addresses; 

capturing source address mformation from said re- 
ceived packet; and 

storing said captured source address information; 

wherein the comparison of said captured destination 
address information to previously stored source 
address information is completed before capturing 
said source address information from said received 
packet; and the transmission of said packet re- 
ceived from said fint data communications system 
to said second data communications system is exe- 
cuted only if said compared destination address 



IS 



20 



25 



IEEE standard 802,5 for Token Ring networks. 

9. The apparatus as claimed in daira 6, wherein one of 
said local area networks complies substantially with one 
of IEEE standard 802.3 for Ethernet networks or IEEE 
standard 02.5 for Token Ring networks, and the other 
of said local area networks complies substantially with 
the other of said IEEE standard 802.3 for Ethernet 
networks or the IEEE standard 802.5 for Token Ring 
networks. 

10. The apparatus as claimed in claim 5, said system 
further comprising: 

means for comparing said captured source address 
information to previously stored source addresses; 
and 

means for adding captured source address informa- 
tion to said previously stored source addresses if 
the captured source address is not contained within 
said previously stored addresses. 

11. An attachment for a first data communications 



information is not found by the step of comparing 30 system adapted for receivmg data packets from a sec- 
said captured destination address information to ond data oonununication system and for transmitting 
said previously stored source addresses. data packets to said second data communications sys- 
4. The method as claimed in claim 3, further compris- tern, each data communications system comprising a 
ing the steps of: plurality of nodes, each of said nodes being adapted to 
comparing said captured source address information 35 receive and transmit packets of information, said pack- 



to previously stored source addresses; and 
adding said captured source address information to 
said previously stored source addresses if the cap- 
tured source address is not contained within said 
previously stored addresses. 40 
5. An apparatus for transmitting information in pack- 
ets in a data communications system including first and 
second data communication subsystems, each of said 
subsystems comprising a plurality of nodes, each of said 
nodes being adapted to receive and transmit packets of 45 
information to and from the other of said nodes, said 
packets of information each comprising destination 
address information, source address information and 
data, the apparatus comprising: 
means for receiving a packet said first subsystems 50 
means for capturing destination address information 

from said received packet; 
means for storing said captured destination address 
information; 

means for comparing said captured destination ad- 55 
dress information to previously stored source ad- 
dresses; 

means for capturing source address information from 

said received packet; and 
means for storing said captured source address infor- 60 

mation; 

wherein the comparison of said captured destination 
address information to previously stored source 
address information is completed before capturing 
said source address information from said received 6S 
packet; and said packet received from said first 
subsystem is transmitted to said second subsystem 
only if said compared destination address informa- 



ets of information each comprising destination address 
information, source address information and data, said 
attachment comprising: 
interface means for receiving a packet from the first 

of said data communications system; 
means for capturing destination address information 

from said received packet; 
means for storing said captured destination address 
information; 

means for comparing said captured destination ad- 
dress information to previously stored source ad- 



means for capturing source address information from 
said received packet; and 

means for storing said captured source address infor- 
mation; 

wherein the comparison of said captured destination 
address information to previously stored source 
address information is completed before capturing 
said source address information from said received 
packet; and said packet received from said first data 
communication system is transmitted to said sec- 
ond data communications system only if said com- 
pared destination address information is not found 
by said means for comparing said captured destina- 
tion address information to said previously stored 
source addresses. 

12. The attachment as claimed in claim 11, further 
including means for selectively interfacing with the data 
communications system through both parallel and serial 
data paths. 

13. The attachment in claim 12, wherein said selec- 
tively interfacing means for interfacing selectively in- 
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terfaces with said data communications system (a) 
through said parallel data path when said first data 
communications network isubstantially complies with 
IEEE 802.3 network standard for Ethernet, and (b) s 
through said series data path when said first data com- 
mimications network substantially complies with IEEE 
802.5 network standard for Token Ring. 

14. The attachment as claimed in claim 10, further \0 
comprising: 



means on said attachment for comparing said cap- 
tured source address information to previously 
stored source addresses; 

wherein said captured source address information 
will be added to said previously stored source ad- 
dresses if the captured source address is not con- 
tamed within said previously stored addresses. 

15. The attachment as claimed in claim 10, wherein 

said attachment to said first data communications sys* 

tem is made by way of a paraUel data path. 

* • * • • 
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PROTOCOL FOR INTERRUPT BUS 
ARBITRAITON IN A MULTI-PROCESSOR 
SYSTEM 

CROSS-REFERENCE TO RELATED 
APHJCMTON 

This qjpUcation is a ocHitimiatiOD of Scr. No. 0S/643»734, 
filed May 6, 1996, U^. Pat Na 3,(S13»12S, wtdcfa Is a 
oontiiiuatlon of Scr. No. OS/049^15, filed 19, 1993, 
abaDdoDed, whidi is a contiiiuation in pert of Scr Na 
08AX)8,074, filed Jan. 22, 1993, U.S. Pat No. 5;i83.904, 
which is a oontiDiulion of Set No. 07/632,149, filed Dec 
21, 1990, abandoned. 

FIELD OTTHBINVENnON 

The pfeseot invention gencnlly iclates to the field of 
nuilti-processar systems and specifically to intccrapt 
controUers desigoed to manage pex^thecal eqoipmoit ser- 
vice interrupt requests in a multi-processor environment 

BACKC3(0UND OF THE INVENTION 

Input/ou^ pedpttnl equqnnent, including such com- 
puter items as printers, scanners and ^splxy devices require 
intermittent servicing by a host processor in order to ensure 
proper functioning. Services, for example, may inchidc data 
delivery, data capture and/or contrcd signals. Eachpmphcral 
will typically have a different servicing schedule that is not 
only dependent on tlie type of device hot also on its 
p rogrammed usage. Hie host processor b required to mul- 
tiplex its servicing activity amongst these devices in accor- 
dance with Ihetr individual needs iidiile running one or meet 
background pro-ams. TWO m^hods to advising the host cf 
a service need have been used: polled device and device 
intorupt mrthnds, Id the former method, each peripheral 
device is periodically diecked to see if a flag has been set 
indiciiting a service request, while, in (he latter method, the 
device service request is routed to an internet contix^cr that 
can internq>t the host, forcing a brancfa fircMn its current 
program to a special intemqpt service TOOtine. The intcrnqjt 
method is advantageous because the host does not have to 
devote unnecessary dodc cycles for polling. Jt is this latter 
method that the present invention addresses. The specific 
problexn addressed by the current invention is the manage- 
ment of intcxnipts in a multi-processar system envircMuncnt 

Multi-prooessor systems, often a set of networked can> 
puters having common poyihcral devices, create a chal- 
lenge in the design of interrupt control methods. For 
instance, in the case of a oofnputer network servicing a 
number cf uscn, it would be hifi^ desirable to distribute 
the interrupt handling load in scnne optimum fsuMoa, Pro- 
cessors that are processing hig^ priority Jobs should be 
relieved of this obligation when processors with lower 
priority jobs arc availsble. Processors operating at the lowest 
priofity should be unifonnly burdened by the intcccopt 
servicing requests. Also, special drcnmstances may reqube 
that a particular VO device be serviced caidusivdy by a 
preselected (or focus) processor. Thus, the current inventi<m 
addresses the problem of optimum dynamic and static 
itttenupt servicing in multi-processor systems. 

Prior art devices, excn^dified by Intel's 82CS9A and 
82380 programmable intcmi^ controllers (PICs), are 
designed to accept a nimober of external intenupt request 
inputs. The essential structure of such controllers, shown in 
FIG. 1, consists of six mi^or blocks: 



IRR- 



ISR- 



bilenapt Request Register a states 
all imninpt lerals (IRQi) on Bon 16 



10 



15 



VR- 



PR- 



hlcnupC ScxTjoe Reftstsr 12 mors 
fill kjCniT^ Icnpds which sb beii^ 
■ervioed, status bring <p ti* l ed upon 
receipt of m enlof-inlengpt (BOD; 
blemiptMssk RegisKr 13 stoses the 
bits mdtcstiqg v^achlRQ tines 16 «» 
to be msAied or dissMed Vr opcntiQg 
ooIRRll: 

Mbctor Regtsten 19, a set of xegisiets, 
one for each IRQ line 1^ aloteft Ifae 



Cootzol Logic- 



TEimbffr nfffibeA. to the host ] 
on data Km 17, oootaittii^ all the 
iE J ttasM i ^ i i i f i sinal ii a i fcr die host to 
sefvjce fiio t eq u es t; 

Mority Ree^ver 15, a )offc block that 
(fetorxniaBs the piknity of the bits aet xn 
DtRll, tbe tis^ prkrity is selected 
wssa scroeea mo iiie cmiBi|w»ining im 
of ISR12 during m iniemvC 
•cknowkdge cycle (IKIA) from the 
host pvoeeesot; 

OoGfdutiiDS die owcniU mteiMiODS of 
tbe other taWfnal Mocks within (be 
same MC^ acttvstes the host ia|Ntf 
iDtemvC (INT) line 19wbeacaecr 
noon bits of IRRll are active, enables 
VR19 to drive tt» internet vector onto 
data bos 17 cbring an IN1A cj^ 
asid JwMW*" flU Iniemqts widi prioiity 
e^pal or lowei tftm flkstbnng 
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Several different mettuxls have been used to assign pri- 
ority to tbe vartoBs IRQ lines Itf, inchtding: 

1) fUUy nested mode, 

2) autcHsatic rotatloo— equal priority devices, made and 

3) sped&c roCatiMir-^^Kcific priority mode. 

The fiilty nested naode, supports a multi-level intenupt 
structure in which an of the IRQ iiipat fines 16 are Biranged 
40 from higjjest to lowest priority: typically IRQ* is assigned 
tike highest priority, ^^dle IRQ7 is the lowest 

Automatic rotation of priorities when the intemqicing 
devices am of equal priority is acoomirfished by rotating 
(dicular sUiting) ttie assigned priorities so that the most 
45 recently served IRQ line is asaigBed ttie lowest priority. In 
diis way, accessibility to intenupt service tends to be sta- 
tistically tevcUed for each of the oc»iq>eting devices. 

Hie specific rotatiOD mettiod gives the user versatility by 
allowing the user to select which IRQ line is to receive the 
lowest priority, all other IRQ lines are then assigned sequen- 
tially (drculaity) hi£^ priorities. 

From the foregoing description it may be seen that PIC 
structures of the type described accommodate uni-processor 
systems with mu^q[)lc peripheral devices but do not accom- 
modate multi-proccss(»' systems with nmltiple shared 
pcripberal devices to whidi the present invention is 
addressed. 



so 



55 



SUMMARY OF THE INVENTION 

60 One object of the present invendcHi is to provide for a 
multi-processor programmable intemqit controller (MPIC) 
system that uses an integrated circuit chip incorporating both 
tbe local processor and an mimed local processor intemipt 
contrcto as a sin^e unit 

65 Another object is to provide a multi-processcr program- 
mable intcoupt coi^Uer (MFIQ system including but not 
limited to the following capabilities: 
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1) nu]lt^le I/O fMriftoaldeWces, each with its own set FIG. 10 shows &e MPIC-ID icgister coQfiguratj<». 

of intemqpts; PIG^ U shows the non-isolated MFIC-Bus connections. 

2) static as well as dynamic nultiiirooesscH^ intemipt hG. 12 shows a tii-state buffered MFIC-Bus anange- 
management inchiding tiie symmetrical distribution of ment 

interrupts over selected processors; s HG. 13 shows the 2-Wt decode i^ocess fcr the MPIC-ID 

3) tevel cr edge triggered intcnupt rcqiuest pins, software used in bus artxtzation. 

selectable per pin; 14 shows flie MPK short message forms. 

4) per pin programmable intenupt vector and steering HG. IS shows the MPIC message encode of the ddivexy 
infonnatioii; njo^ 

5) programmable vector address field defined by eadi fk}. 16 d^es the control hits of the MPIC message, 
operating system; 27 defines the extended delivery mode control hit 

6) intcT-^HOcessar intemipts allowing any processor to coding. 

intanq>t any other fw dynamic leaUocation of intern^ j^jq ^^^^ ^ MPIC-Bus medium and long message 

tasks; and " f omats. 

7) support of syjton wide s^iport functioDs lelated to jpjQ^ ^hows the Base 0, 1, and 2 time ^oatos: 
sJSrSJ^r^ ^ FIG.2tshowsthenivide(Base2)ConfigurationR^ 

The pcesont invention achieves ftesc capabilities by ****^!?^"f"^ ^ ,^ * . 

ineans of an MHCsystaa structure which indudes three 20 ^G. 21 diows the contents of the three tmKsrl^Vecte 

major subsystems: TMt, 

1) an 1/0 MPIC unit for acquiring intcaiqpC request (IRQ) DeSCRJFTlON OF THE PREFERRED 

signals fiom its associated roperii^Kral devices, hav- EMBODIMENnr(S) 

ing a redirection table for processor selection and . ^. ^. , 

vector/priority information; " A muta-processor ivogrammable intenupt controller 

connected to the associated nrocessor or units that am wmuo <us^ dwwi « « »pcA,4A«v 

t;uaucMCM w un> wK»vuai«u iauvc^km sm uiuu uiu aiu numbcr of inxNit olns. Wts, devlces. etc.. in order to nrovlde 

partially <a: totally integrated into the associated ^TTf^ i, TT ^ r m «qcr w jroviac 

Lces^^cadimaLgii^^^ a thorou^ undexstandhig &e 

SJ^c«^I«^^w^^ 30 t*» present inveatitML It will be obvious, howevcz, to one 

cmc system processcar iDCludlng pending, nesting and ^» . . ^ , , , . 

.^—Lrt^^^ « «i«*-7«I^Zl^ i«I!L,«# skilled m the art that the present mventzon may be practiced 

^Sl)^ intcr-piocessorlntcnupc without these g>edficdetJdU. In other instanc^wedl-known 

^ „ . _„ , „ , , circuits have not been shown in detail, or have been shown 

3) a dedicated W> bus, distinct from any ^ystem or in block diagram fam only, in cmUr to avoid unnecessarily 

mcmcoy bus, for ccMnmunications between the I/O and obscuring the wesent invention. 

localMTCi^ Additionally, in desoibing the {vesent invention, refcx- 

Jt is a Jiirther o^ert of this invention to support systwn ^nce is made to ^i^TZes p^ to the cunently 

s«hng gnuuJanty of one ix., one piooc^ at a tm^e p^ened embodlmekRetence^T^ i^edfic name^ 

without sigmficant penalty for any numbcr of processors. ^ould not be construed as a limitation on die^il^ 

BRIEF DESQUFTION OF THE IHtAWINGS ^ of the present invcntioQ. 

A. Overview of Hbc MFIC Aidtutecture 

The present mvCThonnaay be understood more fully firom xhc muM-processcr programmable intcxnq>t controller 

the (tetany dcscrqj^n givenbdow and fiocm the acocra*- ^^^^ designed to accommodate intenupt ser- 

panyu^ drawings of the jwfcned OTbotocnts of ^c vidng in a multi-processc»cnvironmctit Current practice is 

invcntioii, which, however should not be taken to mmt me 45 mainl y concerned with uni-processor systems in which the 

invention to the q>edficaDbodirneiit but am for explanation inleniyrts of a nmhber of peripheral units arc serviced by a 

and understanding only. singleprocessor aidedby aprogrammaWchUenuptcootiol. 

FIG. 1 depicts a block diagram of a common prior art ler (PIC). In a multi-processor system, it is often desfaidile to 

uni-prooessor programmable intemipt controller (FIC). share tiie burden of intern:^ servicing among a group of 

FIG. 2 is a block diagram of the currently ptcfened 50 similar processors. This inpUes the ability to broadcast 

multi-processor programmaUe intenupt controller (MF1EC) interrupt service requests to the pertinent group of proces- 

system. sors and a medianism for determining the equitable assign- 

FBG. 3 is a block diagram of the cuncntly pnfencd ment of the tasks anoongst die processors. The um-psocessor 

VO-MPIC unit design problem is significanlly sinqto: tbeFK) dedicated to 

FK5. 4 shows the various fields that make-«9 a Redirec- processor asrigns a priority to each internet request 

tion lUilc 64-blt entry (IRQ) orders the retpiests aooordittg to the assigned 

FIG. 5 is a block diagram of the cunently ^mtd ft*^^*?».J*f n«^sary Monnation to the 

iocal-MFIC unit. processor to timely initiate the a ppropnate servicing sub- 

routitte. 

J^'^^'^^'^'^f^ 60 The MHC system provides both static and dynamic 

vector table entries of a local-MFICumt intemipt task assignment to the various processors. When 

EK5. 7 shows the various field assignments of the interrupt operatmg in a purely static mode, it functions much as a PIC 

Command Register. in a uni-prooessor system assigning each intenupt according 

FKj. 8 depicts the tracking of the remote IRR bit by the to a prescribed schedule, 

destination IRR bit 65 When <^>erating in a dynamic mode, the MFIC manages 

FIG. 9 is a flow chart depicting the intern^ acceptance intemipt task assignment by taking into consideratioa the 

process by a local-MHC unit relative task priority between processors. 
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It is expected that more typical us^ would cnt&n elc- 5) bus padty cmr intcsxupt gcocratcd l]y any k>cal Mn 

ments of both static and dynamic imemipt managonent onit tbat detects a parity error cm the data bus causing 

Static awrfgnmmts ni^ be made, for examiile, when its bost to be intenupted. 

Iicrnsing considerations prodnde the shared use of servicing The destination of an intennpt can be zfro, one^ or a group 

softv^are. Under other circumstances it may be desiraible to 5 of p roce sso g s in die system. A diffecent destination can be 

lestzict die interrupt servicing task to a suteet of processors q)ecified fbr each internq>t The sender spedfies destinati^ 

that share a common peripheral subsystem. In the extreme of as interrupt in one of two destination modes: physical 

case, all processcKS are subject to interrupt requests from all mode and logical mode. 

per^cral subsystems. In physical mode the destination processor is specified by 

FHj. 2 is a block diagram of the cmrently prcfcned g unique 8-bit MFXC-ID. Only a single destination or a 

multi-processor programmable intctnqjt controUer (MPIQ broadcast to all (MPK-ID of aU ones) can be specified in 

system. The MPIC consists of three m^ units: VO- p^y^ destination mode. 

^S?^^'^*^ EachMFlCunithas ar^ster that contains the unit's Writ 

"^"^J^tT IJ^^J*** ^Ji^^J^ MHC-m. The MHC-mlerves as a physical name of the 

?2rS^S?SSS^^^ 15 MHC unit ft can be used in spcdfy^ destination in^ 

mo -^i^^ madon and is also used for aSig Ae MPlC-bos. The 

TbS^L iT'S^^^ medianism by whic* an MHC establishes its 1^C-ID is 

lt4 mropriately formatted IRQ messages containing all implementation dq>cndent Some m4>lemeatations may 

necessary klenti^ing and trloi^ infoimation. Each local laicfa in the MFEC-ID on some of their pins from slot number 

MFEC unit examines die message and deddes ^edicr 20 at reset time. The MFTC-ID is read-^^ by sc^are. 

to accept it If tentatively accepted by more than one local HksMPIC-ID serves as the physical "name" of the MPIC 

MnC unit IM, an arbitntionpsocedare is invoked between unit used for addressing the MPIC in physical destination 

conq>cting units. The local MPIC unit 104 with the lowest mode and for MFSC-bus usage. 

priority wins the arbitration and accepts the IRQ and timely In logical mode, destinations are specified using a 32-bit 

dispenses it to its associa^ processor lt5. 23 destination field. All local MPIC units contain a 32-bit 

System bus 3t is tlie common means for communicating Logical Destination rcgista 223 against which the destina- 

between processors, memory, and other pei^eral units of tion field of the intearqyt is nuOcfaed to determine if the 

the mnlti-processor system. Each processor and peripheral is receiver is twing targeted by the iittemipC An additional 

interlaced with system bus 39 by means of a memory bus 32-iMt Destini^n Fonnat register 221 in each local-MPIC 

controller (MBQ 3L In prior art systems, system bus 3# ^ mut defines exacdy how the dfstinflti<^n field is to be 

cams intccnopt request traffic, interrupt servicing traffic; and com^Mired against the destination format register. In other 

all odier inter-unit system traffic. The present invention wortb, the Destinaticm Focmat register 221 defines the 

relegates intenupt request traffic to the MFIC bus 193i, iBterpretation of the logical destination information, 

diereby increasing the overall system effideocy. The Destination Rrmat register 221 ptftitioiis the 32-tat 

B. imoxupt Cofitrol 3^ destination information into two fields: 

The intetnqpt control functions of betfa VO and local an encoded tic Id that can be used to represent some 

MnC units are OQUectivelyieflponsfl>le for dclivenngint^^ scalar ID. Matchiitg on the encoded field requires an 

rupts fnm interrupt sources to an intennpt servicing pro- exact Tn*trfi on the value of this fidd. To support 

cesser in a multi-processor system. broadcast to all in logical mode, an encoded fidd vahie 

Each mteirupt has an idratity* the intecn^C vector^ dud ^ ^ ^ £3 treated special in that it matches any 

uniqudy distinguishes tiie inteniqit from ottier intczn^ in encoded value. 

die system. When a processor accepts an intcmipt (IRQ), it ^ decoded fidd (or l»t array) that can be used to 

^^hiTin^S^ ^•^^St^ • ^ Matdiing on die decoded 

S^^iSL^^^ ^S^JS^'v^LH; fiddiequires that at least one of die cooesponding pair 

faitd emb«toeirt 81^^ ^ <tf Wis in the decoded fidds are both ones. 

die ran^ of 0 to 255. DcstinatiMi Format register 221 is contn^Ded by 

Each intern^ has an i^aujj priori^ software and determines which Wts in die deslinatioo infor- 

fivemostsi^<»nt bi^^^ mation are part of die encoded fidd and whidi hits arc part 

priority levds witti 0 bcmg and 15 bdng die highest ^ ^ fidd, Tb have a matdi on die destination, 

pnwity. TTus implies that 16 different vectors may share a j^^j^jj matOL 

siii^cintcrxupt priority 1^ The logical-lcvd intoprctadon of what eadi fidd really 

btenupts are^^^ sources, ,eprescnt8 is totally defined by die operating system. Note 

wmdi may include: ^ ^ consecutive bits and tturt the 

1) external VO devices connected to die VCyMFKl unit of ddierfiddcanbe zero. Azero-lengdifldddways 
manifestedbyeidier edges (levd transitions) or levds 53 matches. Since destinaticm IntHprctation is dwic locally by 
on intenupt inpid pins and may be redirected to aiiy each local-MFICunk, the Destinatfon Format registers of all 
P^^xxsscr; MP!C units in a system must be set up identicaPy. 

2) locally connected device intcnupts.alw^sdfaecled to TiMec exanvie usage modds using differ^ intcfpreta- 
tfae local processor oidy, manifested as an edge or levd tions are described next to furdier illustrate the destination 

60 specification mechanisnL These are probably the most com- 

3) MFIC timer interrupts, generated widun die local- mon modds used in practice. 
MPIC unit by any of die three programmable tiiners; EXAMPLE 1 

4) intcr-proGesstt' intern^ addressed to any individual 

processor of groups of processors in siqjport of soft. Single-Lcvd Modd 
ware sdf inton^, pre-enqitive sdieduUng, cache 63 

memory table look-aside buffer (TLB) flu^g, and In this modd, all 32 bits of destination information are 

intenupt forwarding; and interpreted as decoded field. Each bit position corre- 
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spoiuls 10 an indivldiul Local MFICimit Bit position 1) intemipt injection is neva rejected; 
coidd oone^nd to physical MFIC-ID, but tliis need 2) inteiiiipts (IR(^) m: never lost; 
notbethecate.1Usscliai]eaUowttliespedflcationof 3) in the case of edge triggered imwrapts, toe same IRQ 
aititiaiy groups oiMPSC lunis smpfy ly settug Ae occimwice Is never delivered mote than once, ic, by 
immber^ bits to one, to allows a 5 deHvaing an intenmit fiist to its focus piocessor (if it 
processors (or locatMH^^ tacitly has one), multiple occunenoes of the same 
sdieme, a MPIC unit is addressed if Its hit 18 set m die intemmt \i*ile die first is pending (semdng not 
destination array. Koadcast to an is addeved by setting conffllcte) are aU raxxded as poiding in the local- 
all 32 d^lination Wts to one. This sdects aU MPK intcrmpt request registci's (IRR's) pending Wt 
units mtbesyston. lo cciresponding to that parthailar intetn^t request; 

EXAMPLE 2 activated intenupts, the stale of ttie VO- 

MFIC's intenupt pin is re-created at die destination 
local-MFIC's DtR pending bit whenever its state differs 
HimK*j«l Model fiom die state of die I/OMnCintBcnQit input pin, die 

lUs model uses encoded and decoded fields of non-K«> destination jocal-MHC only initiating tte same IRQ 

lengttis.Thcencodedfieldr<i>esentsastaticclustacf ?^ 
load MHC units, while a wTpositton in die decoded J^^^*"*"^* "'^''=^1^'** 
fieldident^anindividnallocalMFICumtwiddndie JS^J''^IZ.^°^°^^S^ ^J^J^ 
chista:AiUtr»ysetsof{»>c»>son>widdnachi.tercan 20 JT^"!, ^ ^^if S !f*^!^ ^ 
be q)edficd by naming the dusts and setting die bits die datinatfon processor fixal stefac mode and dynamic 

inftTderadedflddfo ^"^^^^IT^ 7^" "^ ."^.^^f^f 

duster. This supports systems widi more tiian 32 ^^^^T^ ™^ by foU«»™«g 

processors, and matdtes a DASH-style duster ardii- inionnatioii. 

tectorc. Bioadcast to an is adiicved by setting an 32 23 1) MFTC-IDX known by each MPIC unit, 

destination bits to one. This guanmtees a matdi on an 2) Destinaticm Address fidd from die I/O-MFIC's 

dusters, and will select aU MFlCs in eadi duster. re-diiection table, 

3) die MFIC unit address; eadi MFIC unit knows its own 
address, 

4) wbetiber an MFIC unit is cunently the fbois for die 
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BiflKxlal Model intenupt, and 

Eadi value of the encoded fidd is the ID <tf an individud ^.^''^P? *^^"/''Sf^*^i ^ ^ 

locd-MHC This ID could be identiad to the MHC s ^^L^Jt^J? .'S^^^ fSSE^J" 

physicn] MFIC-ID, but dkis need not be the case. Badi 33 unoonditionaUy biowtowt by toe VO MHC to an MPIC s 

Wt in the decoded fidd roiesents a pcedefined gioua ^ destination addr^ field for the parucuiar 

This sdione allows addi^ a ain^e MPIC unft^ J!^^^ fj^ ^S^^tl^^'J^^^ 

using its ID in the ews^flddl^ sdecting no ^^^^^^J^ ^ ^^J^^^''.^JZ'^J^ 

groijs), or to addiess a gioup (or union of gioups) of *vaflabic tl^ urtmg* i» ^J»^P™^ ^ *^ d«tmation 

MEICs by «tdng die hooded fidd to an^^cfand 40 P«>c««'s load^C u^ pnonty is low 

sdedfaig tfiepi™^ in die decoded Add. Bad. MHC enough to to^dielo^^ 

unit couU be a mcnibcr of mnh^ple gioups. Supporting pwcessoL Rxed rcndiicdion results in: 

broadcast to an in the bimodal modd lequiies that 1) distzibution acxms all fsooesson; and 

scrftware define a group diat contains ail local-MFICs in 2) assignment of a specific local-MPIC to a given inter- 

the system. Broadcast is thea achieved by setting an 32 45 rupt 

destination bits to one. This matches tU individual IDs Rxed rediiectjon anows existing single threaded device 
and also nutcfaes on the group that contains 811 local drivers to ftincdoo in a multi^ffocessor environment pro- 
units, vided that software binds the (kivcr code to run on one 

Each processor has a processor priority that indicates the processor and the MPIC unit is programmed for fixed 

idativc inqxiitance of the task cr code that the processor is 50 ddiveiy mode so du£ the device's intenupt is directed to the 

currently executing* This code my be part of a process cr same processor on which the driver runa, 

thread, or it may be an intemqit handieL The priority is The lowest-priority le^dkection nkode causes die lowest 

dynandcafiy raised or lowered widi changing tasks thus priority available processor in a ffoup q^edfied by the 

masking out lower priority Internqits. Upon scmangcrf an re-direct address fidd to service the intern^ Because eadi 

IRQ, die processorfctuns to a previously interrupted active. 55 <^ these lowest-iadority processors local-KfflC knows dieir 

A processcH' is lowest pcinity within a given groi^ of associated processor's priority, an azhitnitioD protocol is 
processots if its processor priority is die lowest of att exercised on the MFEC-bus to detennine the lowest priority, 
processofs in the groi^. Because one or more processors If more than one processor is operating at die lowest- 
may be sinudtaneously lowest priority within a given group, priority, then one of them my be pidced at random. An 
availability is subject to die process of arbitration. 60 adtfitional processor selection algorithm is applied to the 

A processor is die focus df an intemqit if it is currently remaining candidate lowest priority processors for the ran- 

servidng that intenupt, or if it currendy has a request dom selection of a processor with the object of uniformly 

pending for duit intern^ spreading die intenupt servicing task amongst the lowest 

An impoitaat feature of the current invention is the priority processors, 

guarantee of exacdy-once delivery semantics of interrupts to 6S C Structural Description 

die specified destination which implies the following The I/O-MFIC unit 1%2 of FIG. 2 is furdier detailed in 

attributes of the Interrupt system: HG. 3. The interrupt iqnit lines lf7 provide the means for 
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the VO devices to inject dicir mteEnQ)ts. An edge filter 198 
is used to provide dean levd transitions at the iiqut pins. Ibe 
le-direction table l$4 has a dedicated 64-t)it entiy for each 
internet ii^ pin (Une) lf7. Unlike the prior ait IRQ pins 
of the S2C59A/S2380 FIC previously discussed, the nodon 5 MMfc(l6> 
of intenupt priority is conqtletely unrelated to the position of 
the physical iDtemipt Input pin on the K) MF1C unit of the 
cuncnt inve&tian. The pricnty of each ispat pin lt7 is 
software programmable by ass^idng an 8-bit vector in tfie 
cofzespooding entiy of the Re-direction Ihble 1#4. lo 

FiG. 4 shows the forinat of eadi Re-diiection Table 
entry. The descrqjtioa of each entry is as follows: 



10 

-continued 



(32:63): 



0 * 9(^fD mtsitivc 

1 - IbtoI wiobUiw 



\bctor(0c7): 
Delnviy Mode (8:10): 

ooo- 

001- 
011- 



AnS-fab&U 



A 3-bit fiftld dMt spedfiM bow the 
local MFKTs fisted m the destkatioa 
fldd ^^'^^''M act iQmi BBCft^pt oT dtit 
ijgiMilt baa fcUowiqg moaniog^ 
Rscd- - dolfwt Id aU pcoccjjofi batDd 



ICQ- 



tOl- 



110- 



111- 



Loweat Morily • dativer to kyirest 
vQiaO 
t indtMmiliOD. 
RanQoto Reod - jaquaat ccwtBrta of an 
ftlFK unit "^'f wfaoao arirtiwia is 
In Ab >fectDr flelc^ to be i&jfed ki (be 
Remote lUtgiiter far a c e a ae by the 

NMI - ddiver to die noo-maakable 
aMemqit (NMI) pin of all liated 



-defirer toaD 
by isaBrtkigAib-ttMxtina dis 



Dri)iK - delhrav to all EilBd iRxcaaocs 
by Maeftii«MM«eitii« tbe local 
MnC*ft debqg pm; txaated ae a hrvel 



Bxt INT . deUy«r to ^ INT pto of all 



8259A oonqwt3)fe iirim mit cxntniki^ 
treatod aa a knpal etmitrw wanal 
(Note that DeUvering Modea of React, DebiQ ndBstlNTaiooot VO 
Anricc ninnqit rolrttid. React and Debug bMspsDocaaor bdcnu^ca 
wfaife dw Exupri mode te ncbicfed to ptovide conpaAMH^ wilfa fiv 
coktiqg 8259A FSC d» fivto atandoid) 
PrjtinitJOD Mode (ll)i bitcrpnli Deituutkjafidd: 

0 - FtQviad Mode • naa MFKMD 
m bite 56:63. 

1 - Logical Mode . 32 bit fibid aa 1t» 
logical ifcjiinition operating 
lyitein ^^^''^^^ 

A 2-bit aoftware icad osfy field 
ooobuflsng cuffcnt dclivciy etatua of 
die intem^it: 
Idle • no cment acuvuj^ 
Send Pending * uitei i u p t uy e cte d to 
local-MnC, held by other hgected 



DeUvoy Stains (12): 



RcanolB IRR (14> 



Tbis bit ia a uttwaie ceadoB^ ie^ 
3it-bit aoftwe wiilea to die 
jBMdiaFcticm table 109 do not aSect 
due bxL. 

MhfDta Ae btoznipt R c tiueat Ri^^iitw 
(IRR) bit of file deatinatiOB toeal^MraC 
for level eensitm iuleuii|itB ojiiff^ and 
when Btitei of bit c&^gnBs with die 
^if^tiw cif tbe f^^Tp^^^^^^^^g ^QSeiv^pt 
ioput fine 107, an VD-MnC iwwanQp 
is sent to make die dMtinatiop'a IRR 
bit leflect die new itate ^**'**"g die 
nmoto (k)oal-Bffl1C> IRR bit to (zaek. 
Ibis tnt b software leadKwIy. 



15 



0 - ma madoed Intemqn (NMI) 

1 - inaAed ntenupt c^iable of besQg 
b b c be d by ^*fg**** pnofity taeikai. 
SA-bn field w|tf jfiijdm lifexni^ 
d Bittiwtto o dfftn a d by tbe i^jfftnting 
arBkm Ifae lower part of HO. 4 
depicts Ifae two poutble fbnmts 
prevtooBlfy diacuaaed: a 32-bit 
pbyvcal (d ecoded) fonnat using one 
bb pef piDoesBor and an 
8[24 bit logical finmot with 8 coded 
bits and 24 decode d bats a 
256 x 24 twoc 



20 



30 



The 64-bit wide re-diiect table 119 is lead/wiite acces* 
aible ttuou^ die 32-hit address and 32 data lines^ DATA/ 
ADUR of a hostpvocessor, cxcqA as noted above for 
tbe Deltvcfy Status and Remote IRR bits which are hard- 
ware wdte and software leed only. 

The rcHitrectioa tabie entries arc focmattcd and broadcast 
to all k>cal-MFIC units 1«4 by MFIC-bns sendAreoeive unit 
111. The MFIC-bos 117 pcotooc^ ipedfies a 5-wire synchro- 
nous bus, 4 wires for data and one wire for its dock. Specific 
details of message fosmats wiU be covered under the section 
on MITC-Bus lYotocol. Acceptance results in the reset of the 
Delivery Status to Idle. 

The local-MFK unit is responsible for intemqit 
acceptance* dispensing of intcnn^ts to the processor and 
sending inta-processor intcoupts. 

Dq;»endiQg ontfae inteorupl detivciy mode specified in the 
intmupt's re-dxrecticnal table entry, zero, one or nkore 
35 MFECumtsmay aooqjt an intern^ A local-MFIC accepts 
an intemipc only if It can deliver the intenupt to its associ- 
ated processor: Accepting an interrupt is purely an I/O- 
MFIC 112 and local-MPKZ IM matter while dispensing an 
intemqit to a processor only invcdves a local-MFlC 1#4 and 
its local proc es sor 145* 

The Re-direction IWe if9 of the rO-MFIC unU 102 
saves to steer intoruptB that originate in die I/O sub-system 
1#1 and may have to be directed to any processor by 
broadcasting ttie Re-direction Ibfale entry corresponding to 
a given iasempi over the MF3C-Bus 1I3l 

FIG. 5 details the s&uctural dements of the local-MPIC 
unit IM. The Local Vector Ibble 219 is similar in iiinction 
to the I/O-MFIC Renliiection Thble 109 auxpit that it is 
restricted to intemipts relating to the associated local pro- 
50 cesser only. Local Vector IbtUe 21t contains six 32-bit 
entries. Kitries 291 through 2ft2 correspond to timers 0 
through 2; entries 293 and 2M coaeq[>ond to local intEmipt 
input and entry 2*5 controls interrupt generation for 
data parity errors. The higher order bits in timer entries 20% 
55 through 2i2 contain timer-specific fields not present in the 
other entries (as detailed under the later discussi<Mi on 
timers). 

Althous^ FKjS. 2 and 5 show local MFIC unit IM as a 
separate entity.^ it may t>e incorporated in whole or in pert 
(0 into the associated processor or processor chip 115. This 
may t>e done in order to In^xove the efficiency of commu- 
nicatioos between the local MPBC unit 104 and the associ- 
ated {Rooessor liS. For example^ the incocpcHation of the 
local MFIC unit*s local intcmqit vector table cQmprising 
units 213 and 2M could provide a more direct path to cache 
memory and thus speed up the flushing of a cache memory 
translation look-up buffer. 
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(OtT): 

Delivczy Mode 
(DSLVXSaO): 



FIG. 6 defines tbe various Adds associated with local 
vector table entries 290 through 2#5. 



An 8-bit field conftinfpg Ifae mtCR^il 
^feetot 

A fidd favrnig die sime mryning 
MS in n&^UnctioD tibfe 109 cotccpl 
kwoft priaritf (001) is lynaaymous 
with filed (ODD). 

Thift bit ndnon fiw iiilBfiint's IRR bit of 
flu* local-MnC ubbL B b oMd Mle^ 
kwel tiisgered local iiiiEatn9<i, b 

iDtcnupta, tad is fofiwsre md^only. 
TUggfit ModD (IM) (15> 0 ittficalM cdae temitfve tEUicr. 1 
H?^^**" kfvd mnfire iiiuiupt. 
Local 'nkavft piot (203, 204) mMf be 
proftmnied at odicj edge or knnel 
triageiBd labile Unocr (300:202) and 
PinQr (305) an alw^s adta 



out (R) (14): 



t(BBXl«): 
Mode(MX17): 

BaoD (18:19): 



0 ffipKW iirtfiin^rt, 1 nMute inifiTiyt 
Select* mode of timen 0 bene ifaot, 

1 b pcriodSc 

S c hc ti ODD of dvee tinio baae tot 



(Mode and Base parameters will be discussed further in 
die sectbn on Hmcr Architectmc.) 

A processor genentes inter-processor internqiCs by writ^ 
ing to die 64-bii interrupt Command Register 220, whose 
layout is similar to that o#the VO-MFIC Re^directlon T^le 
199. The prpgnumnable format, very similar to an entiy in 
the Rc-directioDlUxle 199 is shown in FEO. 7, allows every 
processor to generate any interrupt tttcreby allowing a 
processor to f<Hwaid an istcnupt origiiially acoqjted by It 
other processors. This feature is also useful for debugging. 
Tbe Interrupt CommandRegister 229 is software read-write. 



95 



DeUvierT Mode (8:10): 



IdcBtifiei 



»(11): 
I>eliwry Ststw (12): 



Lml De-MMrt (14): 



Sane uueipntatioa ae fiv 
Rfr^finctkm'Ubte 109. 
Same imeipretatica ai fcr 
R»<dinctioa'Uble 109. 
Same iiU w p ic lati on aa to Be-di r p rt ioo 
IMUe 109. Local ptoocaoor aett 
swniy kcaU^nC 19-dBtca. Soffcwara 
WKj scad due fiold to find ovl if die 
iiSanyt bae ben sent aodt If bo» 
InaBomit f iH Kc'jblfi 220 la 
leady &r aoorptiag a new atmvpL If 
rpgisKf 220 b ovciMvrittni befcre tiie 
deUveqr b Idb (0), dken file 
statoa of l^at fflites^^ipt la ondcfined 
(iney of im^ not have beeo aixciAed). 
A bit iieed itt co^pBctioD wifii IKsger 
Mode (15) to eamdite 
aseettionM^aneftkin of Icvd 
Meiiiitivo iottnaftt (0<lfria8aeft^ 
I-aaecft). For cssnpb, with Deltray 
Mode at Ro»ct, l^iSSBT Mode St Lcrd 
md Letwl De neurit at UfeaotisiiLa 
dc aeecft of Reset tothe pBoeeeeorcf 
tl» adtaaed MFK:(s> Ihb 
oosid^ai will abo cauee ell llFK)i to 
reset tfaeir Artxtretioo-ID (need fir fie 
bivtkii^ B lowost prioriQr ai bitiati OD) 
to dkeMnC-ID. 

Same as for Re£ieetiMi 1U>b 109. 
RoDoto RcmI Status (16:17) Mdicates statu of data oootaiaed in 
die Remote Read nrjiwtrT 224: 
00 - InrmlkJ - coflteat of RexDOte 
Read regster 22* inralid 
xcoolB MHC unit unabfe to 
deliver. 
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-continued 



01 - In Prpgiesa - Remote Read ia 



(18:19): 



10 



13 



20 



23 



30 



(32:63>: 



10 -MOid- Remote Read 
cnnipbte^ valid data. 
A 3-Vit field OBcd to specify a 
deatkiatioD wilbcut fiie need to 
troide tl» 32-bit destinafion field. 
Ibb laduoTi aoftvare ovediead by 
not lequiiing a s ecopd 32-bit wiile 
cyiiatiuQ c op c^ po p d a^ g to bit field 
32:63 fcr the follow mrmnnn csm: 

a) fluClwaiB self inteiuiytf 

b) hiPwmp^ to a single fixed 



fiiat 



Tbe 2-b(t code b flntetpiated aa 

00 • No daatltaiil, nee D eatinatin p 
field (32:63). 

01 - Self; cvTCot loeal-MnC btbe 
GDI7 deatiualiou (iiaed fisr 
auftwan bttnupta). 

10 - AH including e^ 

11 - AD coKtnding acU^ naed dunng 
zoaet and riftwig, 

Opecaling lyslem defined, same aa for 
Retfiwcfion'ttble 109.Uaedop» 
wlten Deatbafion Sboflhand b aet to 
]>e8tFbld(D00). 



c) intempt to all i 
Deatisafion field (32:63X 



IHg^ Mode (13): 



The ro-MFIC unit 102 and an local MPIC units IM 
receive messages via die MPIG-Bas W, The MHC unites 
firm check to see if it belongs to the destinatbn in the 
message. For example, in Ihe case of the 32-bit destination 
format previously died* each MFIC unit with an ID vahie in 
MPIOTD Register 222 less than 32 uses it MFIC-ID to indn 
into the 32-bit destination airay. If it finds its bit set, then the 
MFIC unit is addressed by this message. In the case of the 
S><24 foimat, each MPIC unit checks if its MFIC-n> is equal 
to die MFIC-ID in the 32-tit desttoatioo field, cr if it is a 
member of the group list, as diown in FfQ. 7, by bit-wise 
ANDing its 244nt groi^ list register (32:55) witfi the gnN^) 
list in tbe message and ORmg all resulting bits together: If 
the MFIC-ID in the message has a value of 255, then tbe 
MFIC unit is addressed by the message as welL 

MFKMnis sendAreceive and aibitraticHi unit 226. FIG. 5, 
directs the destination and mode information on output 267 
to accqjtance logic unit 248 which perfcoros the logic 
operations in conjunction with the contents of MFIC-ID 
5Q Register 222. the message is accepted, the vector infor- 
mation available on output 266 of unit 226, is decoded and 
together widi the mode information is passed on lfaroug)t to 
the 3x256 bit vector array 23t by vector decode unit 22S. 
Tbe 8-bit iotexxupt Vector, ^i^ien decoded by Vector Decode 
228 d^ermines which tnt position out of 256 possible is set 
indicating die intermpt priority. When an intorupt is being 
serviced, all equal or lower pricrlty interrupts are automati- 
cally masked by the prinritizcr unit 24Il 
3x256 bit Vector Array 23t consists of 256 bit vectm 
^ used for storing interrupt related information. Each register 
is software road-only and hardwoe roadAvritc The registers 
are defined as follows: 



ISR, InScmee Rpgbter 231, afaowa 
63 nUenmpts that are cuRcady in ecfvkc 

fbr wtndi no cixl of inieinipt (BOl) has 
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locd MPBC unit bu not diiymiwi to 






IMR« 


TH»ec Mode lU^istBr 2H bufieites 




wbetfaer inlcanqjit it ■ kvd or e^ffc 
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-coatiDued noo-zoo in Ifae duce Isb be used to desaibe a task 

sdieduUng class between • (liUe) and 1 for the pmpose of 
asdgning an intenupL Tlus is pacticulariy useful vrhtn a 
number of processors aie operating at die same loiwest level 

5 ofpdocity. 

A processcr's priority is derived firoan flfae ITR 242, ISR 
23 1 and IRR 232. It is tfa e maximum of its task priociQr, and 
tbe piiosity of die highest order ISR Ut, and die priority of 
the highest IRR bit, all evaluated using Che four most 
jQ significant bits of their coded ^bit representation. This 
value, used in determimng availability of a local-MFK to 
accept an intcmict and in ^U^fpftin^ng the lowest priority 

H^«L^i^^n?^jS3^ ^ local.MHC accepts an intcmipt. it guaiantecs 

ifrtJcndiei^iVtgoesirsavicc(ISRhitsct).Inst^ amaslaWemternvtisconti^byftelOT/IOT^^ 

IRR bdt mtrrars die state of d« inteoupt*s it^mt pin. As which begins wiHidielocal-MFIC unit asserting die INT pin 

previously <fiscusscd, when Ac level tdggotsd intetmpt is ^ i« cwmecttd to die procwsor INT ^ If the 

de^scrted, die source VO-hltlC detects die discrepancy processor has intemipts enAt^ it wiU respond by issuing 

ami scab a message to die destination local-MPKZumt to 20 an INTA cycle on line 261 causing die local-MPIC to fcceze 

dear its IRR bit tntemal priority stale and release the 8-bit Vectcr of the 

nG.Sshows,by wayof anexan<ilc,hawtficicmotcIRR Va<xty inlanjt wito die processor data ^IJ^- 

and die IRR bit at the destination local-MFIC unit track die Th e proc essor reads die Vector and uses it to find die 

state cf die intesxupt input (INTIN). It else illustrates bow an " ^p^ handler's entiy-point The local-MFIC also sets the 

EOI is followed immediatdy by die le-assotion of die 23 intcnupt's BR-bit The corresponding IRR bit is dcaicd 

mtantpt as long as the INTIN is still asserted by some ^ ™^ 234 indicates an edge triggered intcmipt as 

deviccInthise3wmplc,ili8assumeddiattwodevioes,Aaad previously disaissed. 

B, sham a level triuend internipt input to die I/O-MFK:. ff a kvd triggered intcmipt is de-asserted just pnoc to its 

Device A raises a level intcmipt as Indicated on line (aX IOTA cyde. all IRR bits may be dear cad die Ptioritiz«24i 

followed by a device Bintarupt as shown on line (b)L The » may not find a Vector toddiver to die processor on data bus 

resulting INTIN signal is die ORed combination of lines (a) Instead, die Ptoitizer 24t wll return a ^p«mcus 

and (b) as shown on line (c), TTie MPlC-bus sendfteccive mexn^\&Axx^V)i^^ The di^pcnsu^dT ttie (SI>0 

unit 111 of FIG. 3 EXCLUSIVB-ORs (XORs) INTIN widi not meet I^ 231, so that die intcmqit handler should 

die remote IRR bit, bit 14 of die Re-directioD lUie 1#4. rctuniwidi«rtissuuig anEOLThcSIVispfogrBmmal^ 

shown on line (e) to yield die 'level assert^ and 35 SIV roister witiun {sioritizcr 241. 

de^issert*'shownonline(d).Thelocal^MMCIRRhittiack8 II is possible diat local MRC units exist in the system ttiat 

die state of line (e) as shown on line (f). Line (g) demon- de not have a processor to which to dispense mtemq}ts.The 

stiates how an EOI is followed immediately by re-asscrthig only danger diis represents in die system is that if an 

die intcanipt as long as die INTIN signal is still asserted by intemipt Is broadcast to all processor using lowest priority 

one of the devices. 40 de&vcry mode when all processors are at the lowest priority, 

FIG. 9 is a fkwdiartd«Ticting die inteBnn)t acceptance ^ » diat a k)cal MHC unit w^hoot^ 

process of a local-MPIC unit Upon recdpl cf a message, a processor may accepedieintenuptifdus MPIC unithai^ais 

local-MFIC unit is the current focus, Le., die pertinent IRR to have die lowest Arb ID at die time. To prevent dus from 

crlSRbitispending^itacceptsdieintcmipcindependemof h^ipening, aU local units initi^ in die disabled st^ 

priority and signals die other local-MPICs to abort die 45 ^ c^»Qt^ enabled before dicy can start accepting 

priority arbitration. This avoids muU^e delivezy of die MPIC messages from d^MPiC-bus. A disabled local MPIC 

samcinteiruptoccurrcncetodiffacntptocessors, consistent Unit only reqwnds to messages widi Delivery Mode set to 

witii irior ait intcmqit delivery semantics in um-process<r "Reset", Res<l«eassr it messages should be sent in Physicid 

systems. If a local-MPIC unit is not, cmrently, the focus it Destination mode using die target's MPIC ID because die 

listens for die acceptance by anodier local-MPIClf mare » logical destination information in tbe local MFICs is undc- 

diaDoneMFICisavailid)le,arbftiatioo,asdesciin)^ fined(an zeroes) wtotte lo^^ 

die section cntided MFIC Bus Ftotoool» is invoked to Betw returning fr<ra an intctiupt handler, scfiwarernust 

determine die winner (lowest priority) unit ^ endnctf-imeBrapt (EOI) command to its local-MnC 

ff a message is sent as NMI, Debug cr Reset, then all units clearing the Mghert priority bit in ISR 231, by writing to EOI 

Hstedindx: destination unoMiditiooallyassat/de-assotdieir 55 r^tftcr 246, indi cating die interrupt is no lonyr in service, 

processor NMI oulput pin 263, Wbag pin 264, or Reset phi ^ causing it to return to the neat W^icd pric^ activity. 

265 of FIG. 5. BR 231 and IRR 232 arc by-passod and The MH C qfrt em is imti al i7«t m die foilowmg manner: 

vector information is undefined. ») Each MPIC unit has a reset liqait phi connected to a 

TheTaakPriorityR^ister(TPR)242of FK3.5 stwcsdie common reset Ibe and activated by die system reset 

correntprioiity of its processor's task which is dynamically ea signal 

subject to change tiecause of eapHdt software actions, such b) The S-Isb of tbe data bus lf6 are latched into MFIC-ID 

as when tasks are switched, and upon entering or reniming roister 222; 

from an intenupt handler. TPR 242 is a 32-bit register c) Bach local-MFIC asserts its processor Reset (RCT) pin 

supporting op to 256 priority levels by means of an &-bLt 265 and resets all internal MPlCiegLSters to their initial 

field (#:7). The four msb (4:7) coireqwnd to the 16 interrupt 65 state, Le., Re-direction Tsblc W and Local Vector 

priorities vidiik the four Isb (9:3) provide additional rcsolu- Tabic 219 set so as to mask interrupt acc^itance, 

tion. For example, aTFRvalue widizen>inthefivemsband odicrwise setting register state to zero; 
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d) Each local-MFEC d&^serts it |ffoces8or'& reset pin to 
allow the processcr to perform serf-test and execute 
initializiitioD code; 

e) The first processor to on MFIC-bus 119 will force 
other processors into reset tsy sending them ihe inter- 
processcr interrupt with 



Deiivefy Mode 
Lcwri De-Anert 



= 0 

= AUExclSelt 



all odier processors being kept in reset until the active 
processor's operating system allows than to become 
active; 

f) The only running processor peifcvms most of the 
system initialization and coiiiiguratiott^ eventually 
booting an operating system which sends out a 
De-assert/Rcset signal to activate the other processors. 
D. MUC-Bus Protocol 

The MFIC-bus 1>3 is a 5-wire synchronous bus c<mnec(- 
ing I/O-MPIC and local-MPBC units. Four of these wires are 
for data transmission and aitritration while one Is a dock 
line« 

Electrically, the bus is wire-OR connected providing both 
bus-use arbitration, and lowest priority aibitratioa. Because 
of the wire^Ht coimection» the bus is run at a lew enough 
speed sudi that design-^>eciflc termination tuning is not 
required. Also, dtt bus speed must allow sufiEHent time in a 
single bus cydc to latch the bus and perform some simple 
logic operations on die itched information in order to 
determine if the next drive cycle must be inhibited, WiA a 
10 MHz bus speed, an intarupt requiring no arbitration 
would be delivered in about 23 |is, and with priority 
arbitration, about 3-4 psw 

Hie MFIC units IBl and IM have sqrarate MFlC-bus 
input and output pins which may be directly connected in a 
non-isolated configuration as shown in FKj. 11. Three-state 
input bufifcrs 391 and output buficn 3#2 may be used to 
provide a tdcrardiical connection to MPfC-buscs diat axe 
required to siq)port a large number of processors as shown 
in FIG. 12. 

Aibitration for use of 4ic ^iHC-bus 183 and for deter- 
mining the lowest priority MFIC unit dq^nds on all MFIC 
message units operating synchronously. Distributed bus 
arbitration is used to deal with the case when muh^e agents 
start transmitting simultaneously, hus arbitration uses a 
small number of arbitration cycles on the MFIC-bus. During 
these cycles, arbitration ^losers** progressively drop off the 
bus until ofdy one *Svinner remains transmitting. Once the 
sending of a message (induding bus arbitration) has started, 
any possible contender must suppress transmission until 
enough cycles have elapsed for the message to be fuDy sent 
The number bus cydes used depends on the type of 
message being sent, 

A bus arbitration cydc starts with the i^ent driving its 
MFIC-ID on the MFIC-bus, higher order bits first Mctc 
q>edfically, the 8-bit MFIC-ID (11:17) is diopped into 
successive groups of 2 bits (17:li) (15:14) (13:12) (ll:lt). 
These tuples l(im)0(0. arc then scquentitaiy decoded to 
produce a four bit pattern (B#:B3) as shown io FIG. 13. Bits 
(B9:B3) are in^vessed on the four MFIC-bus lines, one bit 
per line. Because of the wired-OR connection to the MFIC- 
bus, each tuple of the ID only asserts to a singiLe wire, 
making it possible for an agent to determine with certainty 
wticdier to drop-off ('lose*0 or to continue arbitrating the 



next cycle fct the following two hits of the MFIC-ID, by 
simply checking whether die bus line agent diat is driving 
the bus is also the highest order 1 on the bus. Id this manner, 
eadi MFIC-Bus cydc arbitrates two bits. 

5 Atfoitratiott is also used to find the local MFIC unit with 
the lowest processor priority. Lowest^ority arbitratbn 
uses the value of die local MFIC unit's Task FtUxity 
Register appended widi an 84nt AtfaitratioQ ID (Arb ID) to 
break lies in case fliem are noultipieMFICs executing at the 

10 lowest priority. 

Using the constant M»t MFIC ID as die Arb ID has a 
tendency to skew symmetry since it would favor MFICs 
widi low ID vahies. An MFIC*s Arb ID is therc£ore not the 
MFIC ID itself but is derived from it N reset, an MFIC's 

13 Arb ID is equal to its MFfC ID. Each time a message is 
broadcast over fiie MFEC-Bus, 811 MFICs increment their 
Arb ID by one, which gives them a dififorent Aib ID value 
fix die next arbitration. The Arb ID is then endian-reversed 
(LSB becomes MSB, etc.) to ensure more random selection 

20 of which MFIC g^ to have die lowest Arb ID next time 
around. The reversed Arb ID is then decoded to generate 
arbitration signals on the MFIC bus as described above. 

After bus arbitration the winner will drive its actual 
mcs sage on the bus. 4 hits per dock in nibb]e-«crial fashion. 

25 MIK messages come in two lengdis: short at 21 cydes and 
long at 30 cydes. Hie intopretation of tbc first 19 cycles is 
the same for all message lengttis. The long message type 
iqjpends cydesforpruoityarbitraticMi to the first 19 cycles. 
The medium message type only occurs if a oooq^ete atW- 

30 trstioQ is not needed u in case when the winner is known 
prior to arfajtia t ton. 

The short message fonnat is shown in FKl. 14 where the 
first colunm represents die message cyde index (1:19) while 
die ncatt four columns rqxesent the 4 data lines of the 

35 MFfC-bus. 

The first four rows (1:4) represent the MFXC-bus arbitra- 
tion cydes y^hext any row has four entries r^xesenting the 
decoded tuple of die MFEC-ID as previously discussed, Le., 
176 . 176 re{nesents tuple (7:6), 154 ... 154, ti^e (5>4), 

40 etc 

Cyde 5 is the extended delivay mode of die message and 
is inteq)rcted in accordance witti FIG. 15. The bit desi^iated 
DM is die Destination Mode bit which is 0 for Ffaysical 
Mode and 1 f(V Logical Mod& Bits Mt, Ml, M2 are 
45 previously assigned in the Redirection lU>le oi FIG. 4. 

Cyde 6 contains die control bits as defined in FIG. 16. 
The extended delivery mode and control bits, cycles 5 and 
6, together drfrrminr the lei^lfa of message needed and the 
intcipretatioii of the remaining fidds of the message as 
SO shown in FIG. 17. 

cydes 7 and H make-i^ the S-bit intcmqvt vector: 

cydes 9 through 16 are the 32-bit destination fidd. 

cyde 17 is a checksum over the data in cydes 5 through 
16. Hie checksum protects the data in these cydes against 
55 tmnsmission enors. The ending MFIC unit provides this 
checksunL 

cyde 18 is a postamfale cyde driven as lUl by the 
sending MFIC aUowiiig all MFICs to peifbrm various 
internal com^MitatioDs based on die information contained in 

60 the reodved message. One of the computations takes the 
computed checksum of the data received in cydes 5 flirough 
16 and comqwres it against the value in cyde 17. IT any 
MPIC unit conqwtes a different than the one 

passed in cycle 17, then that MFIC will signal an etrcvon the 

65 MFIC-bus in cyde 19 by driving it as lllLffdiis happens, 
all MPIC units will assume the message was never sent and 
the sender must try sending die message again, which 
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mdttde8fe-flifoitntiiigfbrteMraC4»8. In nuiUe timers 2M, 291 and 212. Bach timar can select its 

delivery when the inteonpt has a focus processor, llie focus dock base from one of three dock inputs. Each timo may 

processor will signal this by driving Hit diolng cyde 19. operate in ddxr a one-shot mode or a periodic mode and 

Iliis tells all die otbal^nC units that the intoxupc has been eidi can be configised to intcmqyt the local processor with 
accepted, the arbitration is preempted, and short messi^ 3 an aibitraiy prognnmnatrie vector, 

fonnat IS used. All (mm focus) MFIC units will drive ItH TTielocal-MFIC unit 1»4 has two indq>cndcnt dock iiqait 

in cyde 19. Under lowest piioiity delivoy mode, IMO pins: CLOCK, pin provides the MFK:*s internal dock, 

in^lies that the intemipt cunentty has no focns processor TMBASB provides for an external dock. The frequency of 

and that priority aititration is required to ccnqdete the TMBASB is fixed by tfie MFIC architecture at 28.636 MHz. 

deliveiy. In that case, long message format is used. If cyde In additioOt the local-MFIC contains a divider that can be 

19 is IM* for non Lowest Fdodty mode, then die message configured to divide dtha dock signal by 2, 4^ S or 16, as 

has been acoqited and is oonsidaed sent shown in FK}. 19. Base 0 is always eqpial to CLOCK; Base 

When an MFIC unit detects and iqwrts an cnor during 1 always oquab TMBASE; and Base 2 may dther equal 

the etric cyde, diat MFIC unit win simpty listen to the bus CLOCK orlMBASE divided by 2, 4, 8 or 16. Divider (iCase 

until it encounters two consecutive idle (#H9) Qydes. 2) Configuration Register is shown in HQ. 2t. 

These two idle Cydes indicate lhat file message has passed Softwne starts a timer by prognomning Its 32-bjt Initial 

and a new message may be started by anyone. Ihis allows Count Register The timer copies fiiis value into fiieOnreitf 

an MFIC that got itsdf out oi Cyde on fiie MFIC-bos get Count Register and starts counting down at the rate <ji one 

back in sync with the other MEK imits. count for each time base pulse (Base t, 1 or 2>. Each tiiw 

Cydes 1 through 19 of die long message format am may operate One-shot or Periodic ff One-shot, the timer 
identical to cycles Itlvough 19 of the short message frxmat 20 counts down once mid rcarkains at zero until reprograinmed. 

As mentioned, kwg message format is used in two cases: In Periodic Mode* tte timer automatically rdoads the coo- 

(1) Lowest pdority deHvexy when the intcnupt does not tents of the Initial Count Register into the Current Count 
have a focus. Cycles 29 through 27 are eight arbitration Register. 

Cydes where the desthiatk>n MFIC detennlnefiie The three timers are configured by means of fiieir Local 
one MPIC unit with lowest processor pticdty/Aib ID 25 Vector lUile entries as shown in FKi. 21. The Afector fidd 

vahKL (9^ is AS previous^ described. The Mask i, bit ( 1<) serves 

(2) RemoteReadmessages.Cydes29throu^^areihe Jl^!!^^^^ 

32 fait contem^rfte remotely read^^stCL This teternipt when count le^ 

7^ , Zr^^^^ ^*^7\«iIIi base imwt used by die i* timer: I^Base 9. tl-Base Land 

Cydei»isaDAccq)tC^I>lowe<tfKiaitydeiivay, ^S^^^SJitetS^ 

28 with 1199 (no accept), «*ile ttie winnimt MFIC unit -s^«"«wiiiu rA«.i», wuuemu j^i«ti«^uaB«u^i»uic 

drives im. If cydV»^^rur*«<SlMnC units 35 onlybjrt»«k«lptoc^ 

know tlu« the ii^lu»beeDlco^ ^J^9^ '^Ji>^ ^ ^ <y^*i>^ «^ 

. 7~ ^^,^r^T\ZZl , >«^"H^ rcttislcrs orovides a convenient and processor architectuic 

L'^^I^^J^f S^'^I^^ l^SSr^of p«^g iJ^S^- data. THe 

^^^^'c?!; t^^^^^ oo^STtbeseregiSHS is n!i fitrpwledby the MPICin 
^^ T^f-^^^l^^^SS^^^ ^ any way. These registm are looted in the same physical 
tmnon. 2^mes«ge M 40 address page as the other local MFIC registers, access to 

aU MPTCs except fiie^^ding lonote MHC unit, that ^^^^P^S.^^^^^ 

drivesfiiebuswithUU,ifitwasabletosaocessfiillysupply wcS^f^toS' 

tbejequestBddatainGydes29thiough27.IfQ^2Sreads 45 j jT^^^^^^s^^^^ 

imthedatainC^29fiiioiigh27isoomdd^ L A miiW-p««eMor^ 

othowisc, the data is conrfdered invalid. The source MFIC aphirali^ of pcocwsor^ 

unit dial issued the Remote Read uses Cyde 28 to detomine a system bus, die plurality of processors oommunicatmg 

the state of file Remote Read Statos fidd in die Intemipt ^ 

Command Register (vaHd or invalid). In any case, a Remote so an interrupt bus having wiredOR data lines; 

Read request is always successful (dfiiougji the data may t>e m iiqut^ooQxit QJO) iulcuu^ controller that broadcasts 

valid or invalid) in that a Remote Read is never retried. Ibe an intcrxiqit request on the intorupt tws; 

mason for (his Is that Remote Read is a debi^ femurs, and a |riarality of load intemipt cootroiUcrs coupled to the 

a*i>ung**iemoteMF[Cthat is unaUe to respond should not intemqit bus, each of the local intern^ controllers 

cause the debugging procedure to hang. 53 being assodated with a concipooding processor <^ die 

Cydes 29 end 39 are two idle cydSk, The MFIC bus is MP system, each local intemipt comraUcr being oper- 

ovaflable for eifeding the next message at cyde 31. Hie two aMe to perfonn an aibitmdon pdority sequence on die 

idle cydes at the end of both short and long messages, wiitd-OR data lines to oibitiatc for aocqjtance <^ die 

togedier witti non zero (ie., non idle) encodings for certain intemqpit request, die arbitration priority sequence 

other bus cycles allow an XfFIClMis agent that happens to be 60 iiK^iiHifag ddving a current priority value associated 

out of phase by one cyde to sync bade up in one message wtdi the corresponding processor on the wired-OR data 

sin^y t>y waiting for two consecudve idle cycles after lines of the intetn^ Ixis. 

reporting its checksum error. This inakes use of the fiutdiat 2.TheMPsystemofdaiml, wherein the current priori^ 

vaHd arbitration cydes am never 9990. vahie associated witti die corresponding processor com- 

B. Hmers 63 prises either a cnncnt task priority, a pending highcst- 

The Local Vector l^ble 219 of file local-MFIC unit 104 priority intern^ request, or a serviced highest-priority intcr- 

containsdtreeindependeDdy operated 32-bit wide program- rupt request 
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3. The MP system of dalm 1, whexdn the cuzrent priority 
value associated with the oosrespoiMling processor com- 
prises a maximum value of dlher a caneot task |Hiosity> a 
pendmg higj»st-priority intecrupt request, or a serviced 
highest-jxiority intcmipt request 

4. The MP system of daim h wherein the cuireiit priori^ 
value associated with the ccKrespomling processor includes 
an appended arbitration ID of the local intecrupt controller. 

5. The MP system of daim 1, wherein the intemtpt 
request indudes an indication cf a focas procefisoc, tiie 
indication p re empti ng the arbitration priority sequence. 

6. The MP system of daim 1, wherein eadi of the local 
intemqpt oontrcUers indudes intccnqit bus dvcuitry diat 
drives current priority vahie CHito the internist bus, 
higher-order bits first 

7. Amuhi-processar (MP) system opemtiog in accordance 
with a protocol for handling intenupts, the MP system 
comprising: 

an intem^A bus; 
a system bus; 

a plurality of processors tfiat communicate across the 
system bus; 

a i^urality of intenupc controller units (ICUs), each of the 
ICUs t)eing coupled to the interrupt bus and a coore- 
sponding processor, each DCV including a first register 
that stores a task priority value of the coxresponding 
processor, a second register that stores an identificattcm 
(ID) of the ICU, and logic wtuch receives inteonipt 
messages on the interrupt bus and dispenses interrupts 
to the ccHTesponding processor in accordance with the 
protocolt tiie protocol Including first and second noodes 
of opeiatioiL, 



16,976 
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in the first mode of operation an interrupt request is 
accepted by the KZU when a destination fidd of the 
intecmpt request matches the ID of the ICU; and 

in the second mode of operation die intentupt request is 
5 accepted by ttte ICU what a current prioity value, 
v^dcfa indudes the task priority vahie, is a lowest 
current priority value horn among die pluraHly of 
processors as drtfarminrd by a bus aihitratiott sequence 
having a number of arbitration cycles. 

8. The MP system of daim 7, wherehi the firstmode Is a 
fixed, static mode of operatioa, and the second mode is a 
dynamic lowest-priority mode of operation. 

9. The MP system cf claim 8, wlurctn the cuDtent i»i(xity 
vahie comprises the task priority value of the covesponding 
processor appended with an sriatratimi ID of die ICU. 

10. The MP system <^ daim 9, wherein the arbitration ID 
is equal to the ID of the ICU at a reset state of the MP 
system, die artdttation ID of each ICU befaig incrcoiiented 
and endian-revcrsed each dme a message is broadcast on the 
intexrapt boa. 

20 11. The MP system of daim 8, wherein the intecnqpt bus 
indudes wlred-OR data lines, during the bus arbimicm 
seqpience the logic of the KHJ driving the current priority 
value on the wircd-CM^ lines. 
IX The MP system of daim II, wherein darug the bus 

23 arbitration sequence the logic of each ICU fintherdiecks the 
wired-OR lines to dftmninr whether another processor has 
the lowest cunrent priority vahie; if so, the logic draps-cff the 
bus atfoitration sequence; otherwise, the logic continues to 
arbitrate in a next arbitration cyde, or untQ die ICU d^er- 

30 mines that die current priority value is the lowest current 
pri<Hily value. 

* m m m m 
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